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Abstract 

This  paper  studies  a  dynamic,  quantity  setting  duopoly  game  characterized  as  follows: 
Each  firm  produces  an  indivisible  output  over  a  potentially  infinite  horizon,  facing  the  con- 
straint that  its  cumulative  production  cannot  exceed  an  initially  given  bound.   The  envir- 
onment is  otherwise  stationary;  the  remaining  productive  capacities  of  the  firms  at  any  mo- 
ment are  common  knowledge;  the  firms  choose  production  plans  contingent  on  these  capaci- 
ties which  are  mutual  best  responses  in  every  contingency.   The  resulting  Markov  Perfect 
Equilibria  are  analyzed  using  a  two— dimensional  backward  induction,  and  compared  with 
the  equilibria  which  emerge  when  precommitment  to  time  paths  of  output  is  possible.  It  is 
shown  that  the  ability  to  precommit  can  be  disadvantageous;  that  collusion  in  Markov 
Equilibrium  is  facilitated  by  the  symmetrical  placement  of  the  firms;  and  that  having 
greater  capacity  confers  basic  strategic  advantage  on  a  firm  by  enabling  it  to  credibly 
threaten  future  production.   The  model  solves  an  open  problem  in  the  theory  of  exhaustible 
resource  economics  by  imposing  subgame  perfection  in  a  resource  oligopoly  with  indepen- 
dent stocks.  It  also  formalizes  the  intuition  that,  when  indivisibilities  are  important,  tacit 
coordination  of  plans  so  as  to  avoid  destructive  competition  is  facilitated  by  establishing  a 
convention  of  "taking  turns"  —  that  is,  a  self— enforcing  norm  of  mutual,  alternate 
forebearance. 
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helpful  discussions;  and  the  Bradley  Foundation,  the  Olin  Foundation,  and  the  Center  for 
Energy  Policy  Research,  MIT,  for  research  support. 

**  This  paper  was  written  while  the  author  was  in  residence  as  a  Visiting  Scholar  at  the 
Department  of  Economics,  MIT. 


I.   Introduction 

This  paper  studies  tacit  collusion  in  a  dynamic  model  of  duopoly.   Firms  face  an 
indivisibility  in  production  at  any  moment.,  and  an  intertemporal  constraint  on  the 
cumulative  amount  of  output  which  is  possible  over  the  horizon  of  the  game.   The  analysis 
explores  the  strategic  implications  of  these  two  conditions:   On  the  one  hand,  due  to  the 
indivisibility,  a  firm  deciding  whether  to  produce  at  a  point  in  time  has  limited  flexibility 
to  accomodate  its  opponent's  action.   On  the  other  hand,  due  (say)  to  the  presence  of  an 
essential,  limited  and  non-reproducible  factor  of  production,  each  firm's  capacity  for  future 
production  is  progressively  diminished  by  decisions  it  has  taken  to  produce  the  past. 

We  study  the  Markov  Perfect  Equilibria  of  the  resulting  dynamic  game  when  each 
firm's  strategy  specifies  a  rate  of  production  at  any  moment  as  a  function  of  the  joint  cap- 
acities for  future  production  remaining  as  of  that  moment.   An  equilibrium  is  a  pair  of  such 
state— contingent  rules  which  are  optimal  for  each  firm,  given  the  other's  strategy,  from 
every  conceivable  state.   In  a  sense  our  game  is  a  "closed— loop",  "multiple  stock"  version  of 
the  "oil"igopoly  problem  encountered  in  the  exhaustible  resources  literature.1  This  is,  to 
our  knowledge,  the  first  treatment  of  perfect  equilibrium  in  a  depletable  resource  model 
with  multiple  stocks.   Previous  work  has  succeeded  in  imposing  subgame  perfection  only  in 
the  context  of  "common  pool"  situations,  in  which  there  is  a  single  limiting  resource.2 

The  assumed  indivisibility  in  production  further  distinguishes  the  present  analysis 
from  existing  work  in  resource  economics.   Our  firms  decide  whether  or  not  to  produce  at 
each  moment,  but  are  unable  to  vary  the  non-zero  rate  at  which  output  is  supplied.   They 
regulate  an  "on-off '  control  variable,  facing  a  constraint  on  the  total  amount  of  time  this 
variable  can  be  "on"  over  the  horizon  of  the  game.   In  effect,  their  output  decisions  are 


'See  Loury  (1986),  and  the  literature  cited  there. 

2An  early  contribution  was  the  study  of  competition  in  the  exploitation  of  a  common  fishery 
by  Levhari  and  Mirman  (1980).   Related  analyzes  include  Reinganum  and  Stokey  (1985V 
and  the  recent,  more  general,  work  of  Sundaram  (1989)  and  Dutta  and  Sundaram  (1989). 


qualitative  choices  about  participation.   The  game  is  analogous  to  a  dynamic  Prisoner's 
Dilemma  in  which  the  capacity  to  play  "tough"  in  the  future  diminishes  with  an  increased 
frequency  of  having  played  "tough"  in  the  past.3  We  focus  on  the  classic  question  in  the 
study  of  the  Prisoner's  Dilemma  —  the  extent  to  which,  in  a  non— cooperation  equilibrium, 
the  players  can  avoid  dissipative  competition  through  mutual  forebearance. 

Characterization  of  closed— loop  equilibrium  in  non— zero  sum  differential  games  with 
several  state  variables  is  a  very  formidable,  largely  open,  problem.   Explicit  solutions  are 
rare  outside  of  the  linear— quadratic  context,  though  a  number  of  economically  interesting 
examples  have  been  identified  with  a  special  structure  permitting  profitable  study.4  In 
this  paper  we  make  progress  by  simplifying  the  problem  in  two  ways:   As  noted,  we  limit 
the  set  of  actions  from  which  the  players  can  choose  at  any  instant;  and,  we  assume  that 
actions,  once  chosen,  must  remain  fixed  for  some  (possibly  quite  short)  interval  of  time. 
These  assumptions  permit  explicit  solutions  for  perfect  equilibria  to  be  obtained. 

Our  ability  to  solve  the  model  explicitly  is  exploited  to  pose  a  number  of  interesting 
questions  for  which  general  answers  are  unavailable  in  the  existing  literature  on  dynamic 
duopoly:   Are  firms  more,  or  less,  able  to  avoid  mutually  destructive  competition  when 
they  can,  and  do,  commit  at  the  initial  date  to  specific  future  actions,  relative  to  a  situa- 
tion in  which  such  commitment  is  not  possible?   What  is  the  relationship  in  equilibrium  be- 
tween the  size  and  distribution  of  payoffs,  on  the  one  hand,  and  capacities  to  produce,  on 
the  other?   How  will  the  distribution  of  remaining  capacity  evolve  over  time?   How  are 


3A  closely  related  model  is  the  continuous  time  Prisoner's  Dilemma  game  of  Rosenthal  and 
Spady  (1989).  There  Markov  Perfect  Equilibria  in  strategies  defined  on  a  two  dimensional 
state  space  of  cumulative  profits  are  studied. 

4A  good  discussion  of  the  differential  game  approach  to  dynamic  oligopoly  problems  is  in 
Fudenberg  and  Tirole  (1986).   Two  linear— quadratic  analyses  are  Fershtman  and  Kamien 
(1987),  who  study  a  deterministic,  one-state-variable  model  of  duopolistic  pricing,  and 
Hansen,  Epple  and  Roberds  (1985),  who  solve  a  stochastic  two-state-variable  model  of 
resource  depletion,  though  without  imposing  sugbgame  perfection.   Other  models  with  a 
structure  permitting  profitable  study  of  subgame  perfect  equilibrium  are  the  memoryless 
patent  race  of  Reinganum  (1981),  the  racing  model  of  Harris  and  Vickers  (1985),  the 
capacity  investment  model  of  Spence  (1979),  extended  in  Fudenberg  and  Tirole  (1983),  and 
the  duopoly  pricing  model  with  menu  costs  of  Halperin  (1990). 


these  considerations  affected  by  the  rate  of  discount  and  the  elasticity  of  market  demand? 

Among  the  results  obtained  are:  (1)  state— contingent  strategies  imply  equilibrium 
payoffs  at  least  as  great,  and  often  greater,  than  those  possible  with  open— loop  strategies; 
(2)  symmetry  of  capacities  facilitates  collusion  in  closed— loop  equilibrium,  but  can  under- 
mine it  when  open-loop  strategies  are  employed;  (3)   the  extent  of  endowments  consistent 
with  attaining  tacit  collusion  in  perfect  equilibrium  is  inversely  related  to  total  capacity,  to 
the  interest  rate,  and  to  the  elasticity  of  demand;  (4)  the  ability  to  credibly  threaten  future 
production  is  a  basic  strategic  advantage  which  allows  the  appropriation  of  a  more  than 
proportionate  share  of  the  industy's  total  payoff.   Result  (1)  implies  that  the  ability  to 
commit  at  the  initial  date  to  future  actions  can  be  disadvantageous!   Result  (2)  suggests 
that,  when  such  commitment  is  not  possible,  firms  are  better  able  to  maintain  high  prices 
to  their  mutual  advantage  if  neither  of  them  enjoys  a  large  market  share!   Result  (4)  pro- 
vides theoretical  support  for  the  common  intuition  that  a  firm  with  a  substantial  share  of 
total  capacity  (Saudi  Arabia  in  OPEC!)  can  disproportionately  influence  the  course  of  play. 

To  obtain  these  results  we  employ  the  two  simplifications  mentioned  to  transform  a 
complete  information,  continuous  time  game  of  dynamic  duopoly  into  a  more  tractable  dis- 
crete time  analogue.   This  conversion  is  described  in  section  II.   Perfect  equilibrium  in 
Markov  strategies  is  defined  in  section  III.   A  two-dimensional  backward  induction  is  em- 
ployed in  section  IV  to  solve  for  explicit  equilibria,  which  are  compared  to  the  open-loop 
Nash  equilibria  of  the  same  game.  In  section  V  we  consider  the  limit  of  equilibrium  behav- 
ior in  the  discrete  model  as  period  length  shrinks,  raising  some  technical  and  conceptual 
issues  discussed  briefly  in  that  and  the  concluding  section  of  the  paper.5 


5Because  of  the  indivisibility  in  production,  coordination  of  play  is  closely  linked  with  the 
idea  of  "precedence"  in  this  model.  Our  discrete  time  equilibria  exploit  this,  and  the  re- 
lated notion  of  "alternation"  heavily.  In  continuous  time,  when  players  are  free  to  change 
their  actions  at  any  instant,  it  is  nolonger  clear  how  to  define  who  produced  "last  period". 
Thus,  it  is  not  clear  how  to  interpret  the  limit,  as  period  length  goes  to  zero,  of  the  discrete 
time  equilibria  analyzed  in  section  IV.  See  the  treatment  of  extensive  form  games  in  con- 
tinuous time  by  Simon  and  Stinchcombe  (1989)  for  a  fuller  discussion  of  these  issues. 


H.   The  Model 

We  consider  the  following  model  of  competition  between  two  independent  firms,   A 
and   B,  facing  an  indivisibility  in  produc  ion,  with  given  limitations  on  their  cumulative 

capacities  to  produce.   Time  is  continuous  over  an  infinite  horizon,  re[0,oo).   At  date   r  the 

a  b 

flow  rates  of  production  of  firms  A  and   B,  denoted  by  q     and  q     respectively,  are  such 

a    b  2 

that   q  =(q  ,q  )e{0,l}  ;  that  is,  either  a  firm  is  inactive,  or  it  is  producing  at  a  unitary 

a      b 

rate  at  any  moment.    Q  =q  +q     is  the  industry's  production  rate  at    r;  obviously 

Q  e{0,l,2},  Vr>0.   A  firm's  initial  capacity  (or  stock)  is  a  number   R.  (j=A,B)   denoting 

feasible  cumulative  (resource  cost  of)  production  for  that  firm  over  the  course  of  the  game; 

2 
Re(Ra,Rr)gIR     is  the  vector  of  initial  capacities/stocks.   Capacity  falls  at  a  unitary  rate  if 

a  firm  is  active;  a  firm  exits  the  game  when  its  capacity  has  fallen  to  zero.   There  is  no  way 

to  add  to  stocks  during  the  play  of  the  game.  The  firms'  stocks  are  common  knowledge. 

The  demand  side  of  the  market  is  passively  modelled;  buyers  do  not  behave  strateg- 
ically.  There  is  an  inverse  demand  function,   P(-),  which  is  time  invariant  and  dependent 
only  on  the  total  rate  of  flow  of  output  of  the  two  firms.   Monetary  units  are  such  that 
P(l)=l>e=P(2)>0.   (For  completeness,  let  P(0)  be  any  finite  number.)  If  e<l/2  then  in- 
dustry flow  revenue  falls  as  the  rate  of  production  increases,  so  we  say  that  demand  is 
inelastic.   If  e>l/2,  then  we  will  refer  to  demand  as  being   elastic.   There  is  no  other  cost  of 
production  beyond  the  opportunity  cost  implied  by  the  depletion  of  capacity.   Future  re- 
ceipts are  discounted  at  the  common,  uniform  rate  r>0.   The  firms'  payoffs  are  simply 
their  respective  discounted  flows  of  revenue  received  over  the  infinite  horizon  as  a  result  of 
their  joint  paths  of  supply. 

We  study  the  equibbria  of  the  following  discrete  time  analogue  of  the  continuous 
time  set— up  introduced  above:   Assume  that  the  firms  cannot  change  their  flow  rate  of 
output  except  at  particular,  exogenously  given  and  uniformly  spaced  moments  in  time  — 
that  is,  except  at  the  beginning  of  a  "period."   After  any  change  they  are  bound  to  main- 
tain their  production  at  the  new  rate  until  the  next  such  moment.   These  moments  when 


adjustments  are  permitted  are  common  knowledge  to  the  two  firms.   Let   h>0  be  the 
length  of  a  "period"  —  the  interval  between  dates  when  a  change  of  action  is  allowed.   A 
discrete  time  dynamic  game  is  defined  for  each  value  of  h,  whose  equilibria  we  character- 
ize.  We  also  study  the  behavior  which  emerges  in  these  equilibria  in  the  limit,  as  h-»0. 

Suppose  then  that   T(h)E{0,  h,  2h,  3h,...}   is  the  set  of  dates  when  new  actions  can 
be  taken.   Associate  with  each   reT(h)  the  integer  t=r/h.   We  will  often  refer  to  "date 
t",  meaning  the  instant   r=h-t,  and  to  "period  t",  meaning  the  interval   [h-t,  h-(t+l)). 
Define  the  discount  factor  8(h)se       ;  a  dollar  received  on  date  t  is  worth   6(h)    dollars  at 
date  zero.   Monetary  flows  over  intervals   [r,r+h),  reT(li),  are  constant  given  our  assump- 
tions, and  may  be  regarded  as  equivalent  lump  sums  paid  at  the  beginning  of  each  period. 
Suppose  that  at  dates  tel  the  firms  take  the  actions   qf.   Then  their  respective  payoffs  are 
V»    and  VR  where: 


(2.1)  VA=../3Sto.P(Q)];     and    V     =  0  S  M-P{Q  )], 

a      tGl      t        t  c      tel      l     '  l 


!  „-rh 
for  /?(h)=[ ],  the  lump  sum  equivalent  of  the  flow  of  one  dollar  over  an  interval  of 

length  h.   Unique  payoffs  are  implied  by  every  path  of  firms'  actions  via  (2.1). 

Given  periods  of  length  h,  we  consider  now  how  the  firms'  productive  capacities 

evolve  over  the  course  of  the  game.   Let  I  be  the  set  of  positive  integers,  and  IeIu{0}. 

Define  the  initial  state,  s  (h,R)=(a  (h,RAb  (h,RR)),  as  follows:   a  =max{aeI|a-h<RA}, 

and  b  =max{beI|b-h<RR}.   (We  suppress  explicit  dependence  of  s     on  h  and  R  when 

that  is  inessential.)   The  initial  state  is  a  pair  of  integers  denoting  the  "number  of  plays" 

which  each  firm  can  make  over  the  course  of  the  game,  where  a  "play"  involves  supplying 

output  at  the  constant  unitary  rate  for  a  period  of  duration  h,  and  where  we  "round— off" 

by  neglecting  the  fact  that  the  initial  stocks  may  not  be  evenly  divisible  by  h.   Let   SeI«I 

2 

denote  the  state  space.  In  this  way,  for  every  initial  vector  of  capacities  ReR      and  every 


h>0,  we  assign  an  initial  state  s  eS,  representing  this  association  by:   s  (h,R)~h—  -R.   It 
is  apparent  that  if  s.    is  the  state  of  the  game  at  date  t,  and  if  q.    is  the  vector  of  firms' 
actions  during  period  t,  then  the  state  al  date  t+1  satisfies:   s.  ,  ,  =  s,  —  q, . 

IQ.   Markov  Perfect  Equilibrium 

It  is  obvious  that,  for  any  given  history  of  play  prior  to  date  t,  the  set  of  feasible 
action  paths  subsequent  to  date  t,  and  the  firms'  conditional  valuations  of  these  paths, 
depend  on  that  history  only  through  the  state  of  the  game  at  t,   s  =(a,,b, ).   That  is,  the 
"state  of  the  game",  as  defined  here,  is  a  "payoff  relevant  state"  in  the  sense,  e.g.,  of 
Maskin  and  Tirole  (1989):   Conditional  valuations  are  independent  of  the  history,  given  the 
stationarity  assumed  in  (2.1).   Feasibility  depends  only  on  whether  remaining  stocks  are 
big  enough;  all  histories  prior  to  t   which  reach  the  state  s,    imply  the  same  set  of  feasible 
subsequent  actions. 

We  employ  the  concept  of  Markov  Perfect  Equilibrium  (MPE)  for  the  discrete 
game,  with  h  fixed,  in  the  analysis  to  follow.   In  general  then,  strategies  are  functions 
from   S  into   [0,1],  reporting  for  each  Cim  the  probability  of  participation  in  the  market 
during  a  period  when  the  state  is  seS.   However,  we  shall  restrict  attention  to  pure  stra- 
tegy MPE.   This  restriction  involves  less  loss  of  generality  than  may  appear  to  be  the  case, 
since  pure  strategy  equilibria  always  exist  in  this  model,  and  mixed  strategies  are  ineffi- 
cient here.  If,  at  some  node  s,   A  and    B   play  "0"  with  positive  probability,  they  risk 
getting  "stuck"  there  for  a  period  —  a  source  of  lost  industry  profits  which  does  not  exist 
with  pure  strategies.   Because  we  are  primarily  concerned  with  understanding  when  a  rev- 
enue maximizing  outcome  occurs  in  equilibrium,  we  restrict  attention  to  pure  strategies. 

Given  a  pair  of  pure  Markov  strategies  on  S,  the  resulting  motion  through  the  lat- 


tice during  the  play  of  the  game  may  be  envisioned  as  exemplified  by  Figure  1.   At  each 

2 
point   seS  four  continuations  are  possible,  corresponding  to  the  four  elements  qe{0,l}  . 

These  outcomes  result  in  motion  of  the  state  from  s  to  some  successor  node,   s'=s— q.  If, 


at  node  s,  both  firms  stay  out  of  the  market   (i.e.,  q=(0,0)),  then  s'=s,  and  the  state 
remains  where  it  began.   With  Markov  strategies  this  amounts  to  remaining  at   s  forever, 
an  obvious  impossibility  in  an  equilibrium  unless  s=(0,0).   The  three  other  continuations 
correspond  to  one  or  both  firms  being  in  the  market,  and  result  in  motion  of  the  state 
either  "due  South",  "due  West",  or  "Southwest",  as  depicted  by  the  arrows  in  the  Figure. 
Eventually  the  state  reaches  one  or  both  of  the  axes,  representing  the  exit  of  one  or  both 
firms.   At  this  point,  the  firm  with  remaining  capacity  produces  one  unit  in  each  period 
until  the  state  reaches  the  origin. 


Figure  1 
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Thus,  with  each  profile  of  pure  Markov  strategies  and  each  initial  state  may  be 
associated  a  unique  path  through  the  lattice  and,  via  (2.1),  unique  payoffs  for  the  firms. 
Given  the  period  length,  these  payoffs  are  a  function  of  the  initial  node,  the  strategy  pair, 
and  the  parameters   e  and  6.  Every  path  terminates  in  a  finite  number  of  steps.   Current 
period  payoffs  (divided  by  /3(h))  at  each  node  depend  on  the  actions  taken  there  as  follows: 


B 

1  0 


1 
0 

0 
1 

0 
0 

Table  1 


We  now  precede  formally,  introducing  some  notation  and  definitions.   Define: 
EAE{<rA:S-*{0,l}|CTA(0,b)=0,VbeI},   EBe{<7b:S^{0,1}  |  aB(a,0)=0,VaeI},  and   £eEA*EB 


A  pure  strategy  profile  (or  simply,  a  profile)  is  an  element    cre£.    From  any  state   seS,  every 
profile   creE  generates  a  path  {s  ,s,,...,s.,...  },  defined  inductively  as  follows: 

(3.1)  sQ  =  s,   and 

st  =  st-l  ~  ^st-l)'    tGl 

We  denote  this  sequence  {s,(s,<7)}.   We  say  that  point   s'    is  a  successor  of  s   under  a  if 
s'e{s,(s,i7)},  and  that   s'    is  a  predecessor  of  s   under  a  if  se{s,(s', cr)}.   Identifying   q, 
in  (2.1)  with   cr(s.(s,<7)),  we  may  define  the  value  function  for  each  firm,   V  .  (<r)(s)  and 
VR(cr)(s),  at  state  seS  given  profile   <reX,  in  the  obvious  way,  as  the  respective  sum  in 
(2.1)  along  the  path   {s,(s,o-)}. 

Definition  3.1:   A  strategy  profile   ctgS  is  a  Nash  Equilibrium  from  sGS  if: 

VA(flr)(s)  >  VA(<rA,&B)(s)    VaAeSA,  and 
VB(a)(s)  >  VB(aA,cB)(s)    VatfZ8. 

Definition  3.2:   A  strategy  profile   o*gS  is  a  Markov  Perfect  Equilibrium  if  it  is  a  Nash 
Equilibrium  from  s,  for  every  SGS. 

Let  S  cS  be  the  set  of  all  MPE's  in  pure  strategies.   Nash  Equilibrium  is  weaker 
than  MPE  since  it  requires  strategies  to  be  best  responses  to  each  other  only  from  a  partic- 
ular initial  state.   In  a  NE  neither  firm  gains  by  deviating  from  its  strategy  at  any  node 
along  the  subsequent  path,  given  that  it  anticipates  the  response  to  the  deviation  to  be  as 
reported  by  the  strategy  of  the  other  firm    Yet  these  responses  are  not  themselves  con- 
strained, beyond  feasibility,  by  the  definition.  Therefore,  NE  may  depend  on  threats  which 
are  not  credible.   MPE  requires  that  promised  responses  to  deviations  must  be  consistent 
with  subsequent  equilibrium  play. 


We  now  formalize  the  idea  that  a  profile   a  is  an  MPE  on  some  subset  of  the  state 
space.   In  the  sequel  we  construct  equilibna  by  finding  particular  strategies  that  are  best 
responses  to  each  other  from  initial  states  in  a  given  region  of  S,  extending  the  resulting 
profile  to  the  entire  state  space  so  as  to  constitute  a  full  equilibrium.  We  say  that  a  set  of 
states  W  is  closed  to  deviations  from  a  profile  a  if  all  paths  from  seW  generated  by   a 
stay  in  W,  and  if  one  firm's  deviation  frcm  such  a  path  also  yields  a  point  in  W.   Denote 
by   a\a-  the  profile  obtained  by  replacing  j's  strategy  under   a  with   a-,  j=A,B. 

2 
Definition  3.3:    Given  a  set  of  states  WcS   and  a  function   a:W  -»  {0,1}   ,  we  say  that   W 

is  closed  to  deviations  from  a  if  the  following  condition  holds: 

VseW,  VaA6EA,  VcrBeEB:   {st(s,a\aA)}cW,  and  {st(s,a\aB)}cW. 

Definition  3.4:   A  strategy  profile   aeE  is  a   restricted  MPE  on  W  if: 

(a)  W  is  closed  to  deviations  from   a;  and 

(b)  g  is  a  NE  from  s,  VseW. 

Let  S|WcS  denote  the  class  of  restricted  MPE  on  W.   Obviously,  if  aeE,w,  then 
cr'eE  and  a'  sa  on  W  implies   cr'  eSi  ,v.   Restricted  equilibria  on  a  given  set  may  thus  be 
thought  of  as  equivalence  classes  of  strategy  profiles,  in  which  membership  is  defined  by 
agreement  on  the  set  in  question.   It  is  also  obvious  that  if  aeE     then   aeE.  w  if  and  only 
if  W  is  closed  to  deviations  from   a.   Theorem  1  establishes  that  every  restricted  MPE 
can  be  extended  to  the  entire  state  space  in  a  manner  consistent  with  Markov  Perfection. 

Consider  the  following  simple  example  which  illustrates  the  definitions.   For  nel 
define  S  n{s=(a,b)eS|  a+b<n}.   Obviously,   S     is  closed  to  deviations  from  any   creE  (as 
is  any  set  containing  the  origin,  and  containing  all  points  "southwest"  of  any  point  it 
contains.)   Now,  consider  the  strategy  profile   o=(a.,(7R)   defined  as  follows:   B   plays 
whenever  he  has  a  positive  stock.   A  waits  until  B  has  exhausted  before  playing  any- 


10 


Tl  —         * 

thing.   Let   n'=Max{nGl \6  >e}.   Then,  if  n'>l,  we  have  that   ae£,        for  2<n<n'+l:   B 

kn 


n 


cannot  gain  from  deviating;  and   A  will  deviate  only  if  e>6  .   But  it  is  easy  to  see  that 


* 


a£E  .   For  max{a,b}  large  enough,   A   will  deviate. 

To  establish  that  a  profile   a  ccnstitutes  an  MPE  it  suffices  to  show  that,  for  every 
seS,  neither  firm  can  increase  his  payoff  by  a  single  deviation  from   a  at  some  point  along 
the  path   {s,(s,<7)},  given  that   a  is  followed  by  both  firms  after  the  deviation.   That  is, 
only  "one— shot"  deviations  need  be  considered.   This  follows  from  the  obvious  fact  that,  if 
a  sequence  of  deviations  is  desirable,  it  must  contain  a  "one— shot"  deviation  at  some  node 
which  raises  the  firm's  payoff.   Moreover,  since  no  firm  will  deviate  at  a  node  where  he 
alone  produces,  only  two  forms  of  deviation  are  relevant:   Either   ct(s)=(1,1)   and  someone 
prefers  to  stay  out  of  the  market,  or   o-(s)e{(l,0),(0,l)}   and  the  firm  not  producing  prefers 
to  be  active. 

At  the  node  s,  for  the  firm  A,  and  given  the  putative  equilibrium  profile   a,  to 
determine  if  a  profitable  deviation  is  possible  we  need  to  compare  two  quantities:   (i) 
Pe+SVj.  (a)(s'),  the  current  revenue  flow  when  q=(l,l)   plus  the  discounted  continuation 
value  of  moving  from  s  to  s'=s— (1,1);  and  (ii)   6V  Ja)(s' '),  the  discounted  continuation 
value  occasioned  by  the  actions  q=(0,l)  which  lead  to  the  node  s"=s— (0,1).   If 
<r(s)=(l,0),   A  never  deviates  from   a  at  s.   If  quantity  (i)  exceeds  (ii)  and   <r(s)=(0,l), 
or  if  (ii)  exceeds  (i)  and   a(s)=(l,l),  then  A  deviates.   The  analogous  conditions  charact- 
erize the  desirability  of  deviations  for  firm  B. 

If  a  profile   a  provides  no  incentive  for  either  firm  to  deviate  from  its  specified 
actions  at  a  given  node,  assuming  it  is  followed  thereafter,  then  the  firms'  value  functions 
Vi(u)(s)  and  V-R(cr)(s)  must  satisfy  standard  dynamic  programming  conditions.   Thus: 

Definition  3.5:   Given  strictly  positive  s?S,  we  say  that  a  strategy  profile   aeS  satisfies  the 
Bellman  Inequalities  at   s  if  cr(s)^(0,0),  and: 
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(3.2)  (a)   o(s)=(l,l)  *  VA(a)(s)  =  (3e+6V A(a)(s-(l,l))  >  ^(^0,1)), 

and  VB(a)(s)  =  0e+«VB(a)(s-{l,l))  >  SVB(a)(s-{l,0)). 

(b)  a(s)=(0,l)=>  /36+iVA(a)(s-(l>l))<«VA(a)(s-(0,l))  =  VA(a)(s). 

(c)  o(b)=(1,0)=»  /3c+*VB(a)(s-(l>l))<«VB(ff)(s-{l>0))  =  VB(a)(s). 

We  summarize  the  foregoing  definitions  and  discussion  with  the  following  useful  result: 

Lemma  1:   Given  WcS  let  the  profile   aeS  satisfy  the  following  conditions  on  W: 

(a)  W  is  closed  to  deviations  from   a; 

(b)  <7A(a,0)=crB(0,b)=l,  at  any  non— zero  points  (a,0)  and  (0,b)e  W; 

(c)  a  satisfies  the  Bellman  Inequalities  at  every  strictly  positive  se W; 
Then   cr£S|W.   If  a  satisfies  (b)  and  (c)  on  all  of  S,  then   ctgS  . 

It  is  possible,  using  a  two— dimensional  backward  induction,  to  characterize  all 
Markov  Perfect  Equilibria  in  pure  strategies  for  this  game.   This  may  be  seen  as  follows: 
For  arbitrary  values  of  V.  and  VR  at  the  nodes  s— (1,1),  s— (1,0)  and  s— (0,1),  there  is 
always  some  pair  of  actions  q=cr(s)  which  permit  the  Bellman  Inequalities  to  be  satisfied 
at  s.   That  is,  for  s  strictly  positive  and   ceE,  either  both  inequalities  in  (3.2)(a)  hold  or 
one  of  them  fails,  in  which  case  one  of  the  inequalities  in  (3.2)(b)  or  (3.2)(c)  holds.   More- 
over, VA(o-)(n,0)=VB(a)(0,n)=/?(l-^n)/(l-^)  for  all   a  satisfying  (b)  of  Lemma  1.   There- 
fore, starting  at  the  point   (1,1)   and  working  systematically  "backward",  up  and  to  the 
right  in  the  lattice,  one  constructs  a  pure  strategy  MPE  by  successively  assigning  values  to 
a  at   s,  given  the  values  already  assigned  at  to   a  at   s'<s,  in  such  a  way  that  the 
Bellman  Inequalities  are  satisfied  at  each  step.   It  is  possible  to  do  this  so  that  at  each  node 
s,  values  for  the  functions  V.(a)(-)  and  VR(c-)(-)  have  already  been  determined  at  the 
nodes  s — (1,1),  s— (1,0)  and  s— (0,1).   Employing  this  induction,  we  prove  that  a  restricted 
MPE  on  a  subset  W  of  the  state  space  can  always  be  extended  to  the  entire  lattice  in  such 
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a  way  that  the  resulting  profile  constitutes  a  full  equilibrium  on   S. 


Theorem  1:   Given  WcS  and   ctg£|W)  there  exists   a*eS     such  that:    o*(s)=a(s),  VseW. 
Proof:   See  the  apppendix. 


Theorem  1  is  an  important  tool  which  shall  be  used  repeatedly  in  the  sequel.   We 
turn  now  to  a  substantive  analysis  of  MFE  strategy  profiles.   The  argument  is  constructive; 
we  solve  the  model  explicitly  to  find  sta.te— contingent  plans  of  action  which  are  mutually 
consistent  with  revenue  maximization  by  both  firms  from  every  conceivable  state.   We 
begin  by  defining  tacit  collusion  and  describing  open— loop  equilibrium  in  the  model. 

IV.   On  the  Possibility  of  Collusion  in  Markov  Perfect  Equilibria 
A.   Tacit  Collusion 

In  this  simple  setting,  with  only  t  *ro  possible  actions,  the  notion  of  tacit  collusion 
takes  a  very  primitive  form.   In  order  to  maximize  the  sum  of  their  payoffs  the  firms  must 
arrange  to  avoid  supplying  at  the  same  time  if  that  reduces  discounted  industry  revenues. 
It  is  obvious  that  the  industry  revenue  maximizing  output  at  any  strictly  positive  node 
s=(a,b)  depends  only  on  the  total  (a+b).   Let   Q*(s)e{0,l,2}    be  this  optimal  collusive 
behavior  at  seS.   Regarding  tacit  collusion  (or  coordination,  or  cooperation),  we  say  that 
the  MPE   ueS     achieves  collusion  at  s  if  cr»(s)+(7o(s)=Q*(s).   The  region  of  collusion 
under   a,  M(a),  is  the  set  of  nodes  at  which   a  achieves  collusion.   If  a  path  satisfies 
{s,(s,a)}cM(a),  seS  and  ce£  ,  we  call  it  a  collusive  path.   Industry  revenues  are  maxim- 
ized in  the  MPE   a  from  the  initial  node  s     iff  a  generates  a  collusive  path  from  s  . 

With  inelastic  demand  it  is  obvious  that  Q*(s)h1,  s^(0,0),  since  supplying  two  units 
instead  of  one  lowers  both  current  and  future  revenue.  With  elastic  demand,  if  s=(a,b) 
and  a+b  is  "large",  then   Q*(s)=2:   Because  of  discounting,  the  opportunity  cost  of  sup- 
plying another  unit  today  vanishes  as  total  capacity  grows  large,  while  the  marginal  rev- 


13 


enue  of  two  instead  of  one  unit  of  output  is  /?(2e— 1)>0.   So  if  e>l/2   then,  for  some  n>l, 
Q*(a,b)=l,  a+b<n;  and   Q*(a,b)=2,  a-i-b>n.   The  critical  n  is  given  as  follows: 


(4.1)  n"  =  Max{neT|  ^j    >  2e+f(^y-  )}  =  Max{neT|  6n   l  >  2e-l}. 


That  is,  n  is  the  largest  total  stock  at  which  the  opportunity  cost  of  an  incremental  unit 
of  production,  given  that  one  unit  is  to  be  supplied  in  each  subsequent  period,  is  no  less 
than  the  marginal  revenue  of  raising  output  from  one  to  two  units.   With  inelastic  demand, 
collusion  in  equilibrium  requires  the  avoidance  of  simultaneous  production  everywhere. 
With  elastic  demand,  such  coordination  is  only  relevant  when  the  total  stock  is  less  than  n. 

B.   Collusion  in  Open  Loop  Equilibrium 

In  order  to  contrast  the  possibilities  for  collusion  in  Markov  Perfect  Equilibria  with 
the  outcomes  achieved  in  the  open-loop  (precommitment)  equilibria  of  the  same  game,  we 
consider  the  latter  briefly.   By  "open— loop"  play  we  mean  that  firms  can  bind  themselves 
at  the  initial  date  to  time  paths  of  actions  which  are  known  to  both.  If  one  firm  believes 
the  other  is  so  committed,  then  it  has  a  corresponding  path  which  maximizes  its  payoff. 
With  paths  constrained  to  be  compatible  with  the  available  capacities  of  the  firms,  we  con- 
sider the  Nash  Equilibria  in  these  paths.   Such  an  equilibrium  achieves  tacit  collusion  from 
the  node  (a,b)  if  it  avoids  simultaneous  production  when  to  do  so  is  necessary  to  maxim- 
ize the  sum  of  the  firms'  payoffs.   The  following  result  characterizes  the  region  in  S  where 
tacit  collusion  may  be  achieved  using  open-loop  strategies. 

Lemma  2:   There  exists  an  open— loop  Nash  Equilibrium  from  the  strictly  positive  node 
(a,b)eS  which  avoids  simultaneous  production  everywhere  along  its  path  if  and  only  if: 
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(4.2)  jMax{a,b}  +  Min{a,b}  <  n'  e  Max{nel|  6n>_e}. 


Proof:   In  an  open— loop  Nash  Equilibrium  neither  firm  wishes  to  shifts  a  unit  of  output 
from  its  last  date  of  positive  supply  to  its  first  date  of  zero  supply.   Letting  the  difference 
in  these  two  dates  for  either  firm  be   0,  such  a  shift  pays  if  e>6  .   So  an  open— loop  equi- 
librium  without  simultaneous  production  is  impossible  if  e>6    for  some  firm,  along  every 
path  where  one  or  the  other,  but  not  both  of  them,  supplies  on  successive  dates.   Consider 
the  problem  of  finding  a  sequence  of  a   "i's"   and  b   "O's"   such  that  the  larger  of:   (1)  the 
difference  in  order  between  the  last  "1"  and  the  first  "0";  and  (2)  the  difference  in  order 
between  the  last  "0"  and  the  first  "1";  is  n.inimal.   Denote  this  minimal  difference  by 
0(a,b).  Thinking  of  this  sequence  as  the  tLne  path  of  supply  for  firm  A  in  an  open-loop 
equilibrium  from  initial  node  (a,b)   which  avoids  simultaneous  production,  we  see  that 
neither  firm  will  deviate  if  0(a,b)<n'.   The  reader  can  verify  that  the  minimal  difference 
0(a,b)  is  realized  by  placing  the  lesser  in  number  of  the  two  digits  in  the  middle  of  the 
sequence  and  surrounding  them  as  evenly  as  possible  on  either  side  by  the  other  digit. 

Assuming  that  a>b,  and  that  a  is  even,  this  is  illustrated  in  Figure  2: 

|< 0 >| 

1,   1,   l,...l,  0,   C,   0,...0,  1,   1,   1....1 
a/2  b  a/2 


Figure  2 

Examination  of  this  Figure  shows  that  this  disposition  cannot  be  improved  upon,  and  that 
0(a,b)  =  Min{nel|n  >  ^Max(a,b)+Min(a,b)}.  d 

With  open— loop  strategies  each  firm  takes  the  other's  path  as  given,  and  so  anti- 
cipates no  consequence  of  an  increase  in  cuvrent  supply  besides  a  reduction  of  its  own  cap- 
acity for  future  production.   So  the  "shadow  price"  on  a  marginal  unit  of  capacity  is  the 
present  value  of  revenue  from  the  last  unit  sold.   There  exist  endowments  from  which  each 
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firms'  initial  shadow  price  on  marginal  capacity  is  less  than    e,  its  marginal  revenue  at  zero 
output,  along  every  collusive  path.   From  such  endowments,  for  which  (4.2)  fails,  both 
firms  produce  on  the  initial  date  in  every  open— loop  equilibrium.   (Note:   2n'<n.)   So, 
whatever  the  elasticity  of  demand,  total  size  and  relative  symmetry  of  capacities  can 
militate  against  achieving  maximal  industry  revenues  in  an  open— loop  equilibrium. 

C.   Precedence  and  Alternation  with  Markov  Strategies 

We  will  show  that  something  close  to  the  opposite  of  the  foregoing  is  true  regarding 
collusion  in  Markov  Perfect  Equilibrium.   Our  method  is  constructive;  we  shall  exhibit  a 
particular  equilibrium  profile  to  establish  this  point.  Intuitively,  closed-loop  strategies  are 
more  effective  at  achieving  coordination  because,  by  making  the  entire  future  course  of  the 
game  contingent  upon  current  actions,  they  can  impose  on  the  firms  a  cost  of  increased  out- 
put far  greater  than  the  foregone  discounted  revenue  from  the  last  unit  sold.  We  demon- 
strate formally  in  Proposition  1  that  when  total  production  capacity  is  arbitrarily  large, 
implying  a  vanishingly  small  shadow  price  for  the  marginal  unit,  if  capacities  are  relatively 
symmetric  and  the  periods  short,  the  firms  can  always  coordinate  their  actions  tacitly  so  as 
to  maximize  discounted  industry  revenue  in  a  non-cooperative,  perfect  equilibrium. 

A  key  concept  in  this  construction  is  the  notion  of  "turn— taking".   With   {0,1} 
actions  and  Markov  strategies,  colluding  to  maintain  a  high  price  amounts  to  specifying 
regions  of  the  state  space  within  which  one  firm  has  precedence,  and  the  other  is  expected 
to  forebear.   Along  a  collusive  path,  one  firm  produces  until  a  node  is  reached  which  signals 
the  other's  "turn",  the  other  then  produces  until  the  state  passes  out  of  its  region  of 
precedence,  and  so  on.   In  equilibrium  the  firm  without  the  turn  must  prefer  forebearance 
to  "playing  out  of  turn." 

A  special  case  of  this  turn-taking  partition  might  be  called  "pure  alternation," 
where  the  firms  share  the  market  by  producing  on  alternate  dates.   To  motivate  the 
alternating  strategy  profile,  consider  a  one— shot  game  with  payoffs  as  given  in  Table  1. 
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This  game  has  (1,1)   as  its  strictly  dominant  Nash  Equilibrium,  providing  each  firm  with  a 
return  of   e.    Suppose  that  the  firms  were  to  encounter  this  game  over  an  infinite  sequence 
of  periods,  with   6  being  their  common  discount  factor.   For  small   e  the  one-shot  equi- 
librium would  imply  substantial  losses.   Both  firms  would  be  much  better-off  if  they  could 
coordinate  their  actions  so  as  to  avoid  "head  to  head"  competition,  yet  each  prefers  to 
supply  in  the  current  period.   Resolution  of  this  Prisoner's  Dilemma— like  problem  requires 
that  the  firms  reach  some  tacit  agreement  as  to  who  will  supply,  and  when. 

A  natural  understanding  which  might  develop  between  them  in  such  a  situation  is: 
"Whoever  produced  last  period  stays  out  this  period".   If  the  discount  factor  is  not  too  low 
it  will  be  better  to  wait  until  tomorrow  for  one  unit  of  revenue,  and  the  implied  oppor- 
tunity to  wait  yet  again  the  day  after  tomorrow,  than  to  claim   e  units  of  revenue  every 
day.   It  is  easy  to  see  that  such  an  alternating  convention,  supported  by  the  threatened 
punishment  of  reverting  to  "(1,1)  forever1'  if  deviation  occurs,  is  a  subgame  perfect  equi- 
librium of  the  infinitely  repeated  game  if  and  only  if  "waiting  to  alternate"  is  better  than 
"competing  forever".   That  is,  if  and  only  if:   5-(l+62+^+...)  =  S/(l-62)  >  e/{l-8).   Or, 


(43)  «  <-  lh- 


This  condition  is  important  in  the  analysis  of  Markov  Equilibrium  to  follow.   Note  that  it 
holds  with  inelastic  demand  if  the  periods  are  short  enough,  but  never  with  elastic  demand. 

Return  now  to  the  dynamic  game  with  cumulative  capacity  constraints  which  we 
have  been  considering.   Formally,  define  a  profile  <reE  as  an  alternating  strategy  profile  if, 
for  all  strictly  positive  seS:   a(s)+cr(s— u(s))=(l,l).   The  path   {s,(s,a)}   therefore  involves 
the  firms  supplying  the  market  alternately  in  successive  periods  for  as  long  as  both  have 
positive  stocks.   There  are  exactly  two  such  profiles,  distinguished  by  which  of  the  firms 
supplies  when  the  stocks  of  both  are  positive  and  their  sum  is  an  even  integer.  We  assume 
without  loss  of  generality  that  firm   A  produces  under  the  alternating  strategy  profile 
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when  the  sum  of  positive  stocks  is  even,  and  denote  the  corresponding  profile  by    aeE. 
Notice  how  the  reduction  of  the  continuous  game  to  one  in  discrete  time  with  a  countable 
state  space,  induced  by  the  assumption  that  actions  cannot  be  instantly  changed,  is  being 
heavily  exploited  in  defining  this  form  of  precedence. 


Figure  3 
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Take  an  arbitrary  point   scS,  and  consider  the  set  of  all  successor  nodes  to  s  under 
a,  which  we  call  the   alternating  path  from  s,  denoted   {s,(s, cr)}.   These  paths  have  a  stair- 
step appearance,  and  include  nodes  along  the  axes  leading  to  (0,0).   Figure  3  illustrates 
several  alternating  paths.   Notice  that  these  paths,  regarded  as  subsets  of  S,  are  closed  to 
deviations  from   a,  VseS.   A  deviation  from   a  means  either  remaining  at  s,  or  "jumping 
ahead"  to  s— (1,1).   By  definition  of  a,   s— (l,l)=[s— a(s)— <r(s— a(s))]=s2(s,o-)e{s,(s,cr)},  for 
s>>(0,0),  so  a  deviation  from   a  always  leads  to  a  node  belonging  to  its  alternating  path 
through  s.   (Indeed,  only  an  alternating  strategy  profile  generates  paths  which  are  closed 
to  deviation  from  itself.)   A  useful  way  to  state  this  fact  is  that,  along  any  alternating 
path,  the  consequence  of  deviation  from  "waiting  one's  turn"  is  always  to  "lose  one's  turn." 

It  is  easy  to  see  that  the  turn— taking  convention  represented  by   a  is  not  an  MPE. 
For  if  <r<e  it  must  be  that   aA(a,l)=l,  VcreS  :   By  playing  "1"  at   (a,l)   A  guarantees 
itself  a  payoff  of  at  least  /3[e+£(l-<Sa-1)/(l-<5)]  which  exceeds  /36{l-^)/{l-S),  the  best  it 
can  get  from  playing  "0."   Since   5rA(a,l)=0  when  a  is  even,  it  follows  that   a£E  . 
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Nevertheless,  there  is  some  non-trivial  subset  of  nodes,  implicitly  defined  as  follows,  from 
which  the  alternating  profile   a  is  a  restricted  MPE: 

Definition  4.1:   For   a  as  above  and  0<f<l   define  the  region  of  alternation,  3(e), by: 

3(e)  =  {s6S|Vs'6{st(s,a)},  if  ap')=Q,  then  Vfa)(a')<6-0§=$,  j=A,B}. 

Theorem  2:   aeE,  g,i  y   Moreover,  if  ^c£|W  then  WcJ(t).   That  is,  3(e)  is  the  larg- 
est subset  of  S   on  which  a  mutual  agreement  to  "take  turns"  could  be  self— enforcing. 

Proof:    3(e)  is  the  countable  union  of  alternating  paths,  each  of  which  is  closed  to  devi- 
ations from   h.   Therefore    3(e)  is  closed  to  deviations  from   a.   The  Theorem  then  follows 
from  Lemma  1,  if  we  can  show  that  Bellman  Inequalities  (3.2)(b  or  c)  are  satisfied  at  every 
strictly  positive  node  s£.3(e).   If  firm   j   "waits  its  turn"  at  s  its  payoff  there  is  V.(a)(s). 
If  it  deviates  by  playing  "out  of  turn"  it  receives  fie  immediately,  but  loses  the  oppor- 
tunity to  have  the  market  to  itself  nexl  period.   This  continuation  yields  the  payoff 
(3e+8V -(o)(s— (1,1))-   But,  because  it  is  waiting  to  have  the  market  to  itself,  it  must  be  that 
V(b-)(s)=6-[0+5V.(a)(s-(l,l))].  So  the  deviation  pays  iff  V.(3-)(s)><5/?(j^).  Thus, 
3(e)  is,  by  definition,  precisely  the  set  of  nodes  where  deviation  from   a  does  not  pay.      d 

This  result  leads  to  the  following  general  rule  about  turn— taking:   A  firm  "waits  its 
turn"  for  one  period  if  the  value  of  that  turn,  when  it  does  come,  does  not  exceed  the  value 
of  receiving,  in  perpetuity,  the  revenue  loss  to  the  other  firm  caused  by  it  having  played 
out  of  turn.   Moreover,  the  argument  for  this  general  rule  only  used  the  alternating  prop- 
erty of  a  "locally,"  at  a  node  s   and  its  immediate  successor.   So  we  have  the  following 
partial  converse  of  Theorem  2,  relevant  whenever  precedence  changes  hands. 
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Corollary  1:   Suppose  aeE  .   Then   a(s)  +  cr(s— cr(s))  =  (1,1),  and    a-(s)=0   imply  that 
V.(a)(s)a/?(|^),  for  je{A,B}   and  seS. 


The  next  result  provides  an  explicit  description  of  .0(e).   We  also  introduce  nota- 
tion which  will  be  used  extensively  in  the  sequel.   Recall  that  n'EMax{n6l|  Sn>e}. 

Definition  4.2:  For  s=(a,b)eS  define  n(s)E|a— b|,  and  k(s)=min{a,b}.  Let  j(s)  denote 
the  larger  firm  and  i(s)  the  smaller  firm  at  s,  i,je{A,B}.  (Explicit  dependence  of  these 
functions  on  s  will  often  be  suppressed.)   Define  the  function  (}>(n,k)   as  follows: 


(4.4)  <|>(n,k)  e  62k~2-  <5n  +  (l-^"2).^),  for  nel  and  kel. 


Corollary  2:    0(e)={seS  [Vkel,  k<k(s):   <|>(n(s)+l,k)>e  if  a-, s\(s')=0,  and  <j)(n(s),k)>e  if 
aw  Js)=0}.   Moreover,  when   (4.3)  holds  then    0(e)={seS|Vt>O,  n(st(s,a))<n'}. 
Proof:   See  the  appendix. 

With  indivisible  production  forebearance,  by  one  firm  or  the  other,  is  essential  if 
they  are  to  share  the  market  and  maintain  a  high  price.   Theorem  2  and  Corollary  1  imply 
that  such  restraint  is  consistent  with  the  firms'  interests  only  if  the  "contingent  right  to 
produce"  has  been  allocated  in  such  a  way  that  payoffs  never  exceed  f3{jzr)-  Any  greater 
return  is  too  large  for  a  firm  to  be  willing  to  "wait  its  turn"  to  receive  it.   Now  the  region 
0(e)  consists  of  nodes  from  which  this  criterion  is  satisfied  at  every  point  along  the  path 
generated  by   a.   Thus,  if  the  initial  node  s  €  0(e)  we  say  that  alternation  is  implement- 
able  (in  MPE),  meaning  that  a  tacit  agreement  to  share  the  market  through  simple  "turn- 
taking"  could  be  implemented  by  a  state— contingent,  self-enforcing  rule.   Corollary  2  shows 
how  the  implementability  of  this  norm  depends  on  the  size  and  distribution  of  capacitities. 

If  demand  is  sufficiently  inelastic,  in  that  (4.3)  holds,  then  0(e)  contains  any  node 
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from  which  alternation  never  causes  the  stocks  to  differ  by  more  than   n' ,  without  regard 
to  total  capacity.    So,  for  £<yr~T,  impleir.entability  depends  only  on  symmetry.    The  higher 
the  interest  rate  or  longer  the  period  length,  the  less  elastic  demand  must  be  for  this  result 
to  hold.   With  elastic  demand  (4.4)  implies  a  trade— off  between  the  aggregate  size  of  cap- 
acity and  the  inequality  of  its  distribution,  in  determining  whether  alternation  is  imple- 
mentable.   For   e>T^-c,  <j)(n,k)>e  only  if  6n>e  (i.e.,  n<n')   and  k  is  not  too  large;  the 
smaller  is  n,  the  larger  can  k  be  while  maintaining  (j)(n,k)>e.   Thus,  the  greater  is  the 
total  capacity,  the  more  equally  must  it  be  held  for  alternation  to  be  implementable. 

Another  implication  of  Theorem  2  is  the  Proposition  below.   Its  proof  examines  the 
limit  as  h-»0  of  the  set  h-  3>{t),  and  uses  Theorem  1  to  extend   a  to  a  full  MPE  on  S. 
This  method  foreshadows  limit  arguments  developed  more  fully  in  section  V  of  the  paper. 

Proposition  1:  For  any  fixed   £6(0,1),  r>0,  and  total  capacity  R~eR.+Rr,  an  MPE  exists 
which  achieves  collusion  whenever  it  is  relevant,  given  that  the  periods  are  short  enough 
and  the  initial  holdings  are  sufficiently  symmetric.    Formally: 

Let  R~,  e,  and  r  be  given,  and  consider  the  vector  of  capacities  R=(R .  ,RR),  where 

_  * 

RA+RB=K.   There  exists  x>0.   K>0,  and   aeS     such  that   {st(so(h,R),cr)}cM(a) 

whenever  h<E,    | R.-Rg |  <x,  and  Q*(s  (h,R))=l. 

Proof:   See  the  appendix. 

The  proof  is  based  upon  the  observation  that,  when  the  periods  are  short  and  the 
firms  equally  placed,  private  incentives  to  conform  with  the  alternating  profile  nearly 
coincide  with  "social"  incentives  to  maximize  industry  revenues:  If  each  firm  has  kxR"/2h 

C  r)1, i 

units,  then  (4.4)  implies  that  neither  firm  deviates  from  a  if   e<(Tqrr)(l  +  ^         )•    On  the 

1         2k— 1 
other  hand,  coordination  is  required  to  maximize  industry  revenues  if  e<-Jl+6         ),  by 

(4.1).   But,  as  h-tO  and   <5->l,  these  condition  both  require  that  R~  <  -ln(2e-l)/r.   Thus, 

the  notions  of  "alternation"  and  "coordination"  are  intrinsically  linked  in  this  model. 
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D.   MPE  Extension  of  the  Alternating  Convention   a 

We  now  complete  the  program  initiated  in  Theorem  2  by  exhibiting  a  profile  which 
is  a  perfect  equilibrium  extension  of  a  to  the  entire  lattice.   The  extension  profile  takes 
the  following  general  form  on  S\  3(e):   The  larger  firm  supplies  one  unit  at  every  node, 
while  the  smaller  firm  supplies  nothing  if  and  only  if,  by  so  doing,  the  equilibrium  path 
reaches    3(e)  within  a  given  number  of  periods.   The  critical  number  of  periods  depends 
only  on  the  smaller  firm's  endowment.   The  argument  is  based  on  a  backward  induction  in 
two  dimensions,  as  suggested  oy  the  discussion  of  section  III.   We  require  the  following: 

Definition  4.3:  For  strictly  positive  seS,  and  0<e<l,  the  "waiting  time"   m(s)  is  given  by: 

(4.5)  m(s)  =  min{meI|V;c,  Kk<k(s):  <J)(n(s)+l-m,k)>e},  if  (j)(l,k(s))>e,  and 

m(s)  e  +cd,   if  <Kl,k(s))<e. 

Theorem  3:   With  any  arbitrary  sequence  {m,  :k>l}cl  associate  the  profile   ueX  which 
extends   a  as  follows:   On  3(e)-,   ct(s)eo-(s).   On  S\3(e),    cr.,.  \(s)=l;    a-,  \(s)=0  if 
l<m(s)<m,/\,  otherwise   a-,  \(s)=l.   For  each   e£(0,l)  there  exists  a  unique  sequence 
{m?:k>l}  such  that  the  associated  profile   a*,  determined  in  this  way,  is  an  MPE. 
Proof:   See  the  appendix. 

Comparing  (4.4)  and  (4.5),  in  light  of  Corollary  2,  one  may  see  that  m(s)  is  the 
time  that  the  smaller  firm  [i(sj]  would  have  to  wait  before  reaching  3(e)  if  it  does  not 
produce,  given  that  the  larger  firm  [j(s)]  supplies  constantly.   With  every  sequence  {m,  } 
we  have  associated  this  strategy  for  i   on   S\  .25(e):   "Wait  for  3(e)  when  holding  k  units 
if  and  only  if  you  can  get  there  within   m,    periods;  otherwise,  produce."    So   {mf}   may 
be  interpreted  as  the  parameters  of  an  "optimal  waiting  policy"  for  i,  given  the  assumed 
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behavior  of  j.   The  Theorem  asserts  that,  given  alternation  on  .25(f),  this  "optimal  wait- 
ing" by  i   together  with  constant  production  by  j  on  S\  .2* (e)  are  mutual  best  responses. 

This  waiting  strategy  extends  the  set  of  nodes  at  which  simultaneous  production  is 
avoided  (and  thus  collusion  is  achieved)  beyond  the  region  of  alternation,  -29(e),  by  giving 
precedence  (and  therefore,  enhanced  payoff)  to  the  firm  with  greater  capacity.   An  interest- 
ing interpretation  of  the  MPE   a*  is  the  t  the  larger  firm  exploits  its  position  so  as  to 
garner  a  greater  than  proportionate  shar^  of  the  collusive  profits  by  credibly  threatening  to 
"produce  all  the  time"  on  S\.25(e),  thereby  persuading  the  smaller  firm  that  it  is  in  its 
interest  to  forebear,  whenever  m(s)<mf  .<  \.   To  see  the  basis  of  this  outcome  consider  a 
node  s  such  that  k(s)=l,  and  n(s)=n>n'-l.  We  noted  above  that   a-, _\(s)=l,  WeE,  at 
such  a  node.   Thus  j  has  the  dominant  play  (or,  credible  threat)  of  "1"  at   s;   i  knows 
this,  and  will  wait  as  long  as  it  can  have  the  market  to  itself  within  n'  periods.   This 
reasoning,  applied  recursively,  generates  the  full  sequence  of  optimal  waiting  times,  {mf}. 

When  demand  is  inelastic  in  the  sense  of  (4.3)  these  waiting  times  can  be  explicitly 
found.   We  give  the  derivation  in  the  text  to  illustrate  the  conceptual  basis  of  Theorem  3. 


Corollary  3:  If  e<Tj-r  then  the  sequence  {mf },  k>l,  determining   a*  is  given  by: 


(4.6)  m*  =  Max{m6l|<5m-(T^)-(l+<52k~1)>f},  kel. 


Proof:  We  use  a  nested  induction  to  construct  i's  best  response  to  {^=1}  on  S\  25(e).  If 
seS\ .25(e)  and  k(s)=l,  then  it  is  obvious  that  i(s)  either  produces  at   s,  earning  /3e,  or 
waits  for  .25(e)  ,  earning  @6m^  '.  So  i   vi aits  if  and  only  if  m(s)<n'— lEm?,  since,  by  the 
proof  of  Corollary  2,  firm  j  produces  under   a  at  the  first  point  reached  in  .29(e). 

Inductively,  assume  that  following   a*  is  optimal  for  i(s)   whenever  k(s)<k— 1,  for 

{m?,  ...,m? 1 }  as  given  in  (4.6),  k>2,  and  consider  i's  best  play  from  seS\  .25(e)   where 

k(s)=k.   The  proof  rests  on  the  fact  that   m(s)<mf  implies  m(s— (l,l))<mf   ,    when  e<yq— i- 
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Corollary  2  implies  that  m(s)=n(s)— N,  for  Ne{n',n'— 1},  and  one  easily  verifies  from  (4.6) 
that  m£e{m£_, ,  m?_ ,— 1},  k>2,  so  m(s— (l,l))=m(s)<mf  <m?_ ..   Hence,  if  firm  i   plays 
"1"  at   s,  continuing  optimally  from  s-(l,l),  it  earns:   /3-[e+^s)+2.(l-<52k_2)/(l-<52)]. 
If  it  waits  for    ^(e)  it  earns:    6    ^'      -(1—8     )/(l—6).   Simple  algebra  shows  waiting  is 
better.   Therefore,  by  induction,    a*   is  optimal  from  all   s   such  that    l<m(s)<m£/  \. 

On  the  other  hand,  if  k(s)=k  and   m(s)=m?+l,  the  same  algebra  now  shows  i(s) 
does  better  to  play  "1"  at  s.   Suppose  this  is  true  when  k(s)=k  and  mf<m(s)<mf+m-l, 
m>2,  and  consider  the  best  play  for  i(s)   when  k(s)=k  and  m(s)=mf+m.   If  it  plays  "1" 
at   s  and  then  "0"  at  s— (1,1),   it  arrives  at  some  node  (say  s')   after  two  periods,  having 
earned  fie.  If,  alternatively,  i  plays  "0"  at  s   and  then  "1"  at  the  successor  node,  it  also 
arrives  at  s'    after  two  periods,  but  now  having  earned  only   6j3e  along  the  way.   Clearly 
the  former  course  dominates  the  latter.   The  induction  hypothesis  on   k  implies   i   will  do 
no  worse  than  the  former  course  if  it  plays  "1"  at   s,  while  the  induction  hypothesis  on  m 
implies  i   can  do  no  better  than  the  latter  course  if  it  plays  "0".   Therefore,  induction  on 
m,  for  fixed  k,  followed  by  induction  on  k,  establishes  that   cfi  is  optimal  for  i  at  any  s 
for  which  m(s)>mw  •>,  completing  the  proof.        □ 

Finally,  we  note  that  the  MPE  a*  obeys  a  kind  of  "inverse  entropy"  law:  The 
market  never  evolves  toward  less  coordinated  play  as  capacities  decline.   For  s£S\3(e), 
coordination  is  achieved  at  s  if  and  only  if  m(s)<mw  \,  in  which  case  the  ensuing  path 
leads  directly  to  .2>(f),  with  firm  i  forebearing  until  this  region  is  reached,  and  alternating 
play  thereafter.   If  m(s— t-(l,l))>m.w  n    ,,  0<t<k(s)—l,  then  both  firms  produce  until  i 
has  exhausted  its  capacity.  If  m(s)>mf/  >  but  m(s— t-(l,l))=mf/  \.,  l<t<k(s)— 1,  the 
equilibrium  path  begins  with  both  producing,  but  i   withdraws  at  the  node  s— t- (1,1)  and 
waits  for  &(e).   So   a*  satisfies:   seM(o*)  and  Q*(s)=l   only  if  {st(s,o*)}cM(o*).   That 
is,  if  coordination  is  achieved  at  any  node,  it  is  achieved  at  every  subsequent  node. 
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E.   The  Limits  to  Collusion  in  Perfect  Equilibrium 

We  complete  this  discussion  of  ta  ;it  collusion  by  showing  that  it  is  possible  to 
achieve  the  industry's  revenue  maximurr  in  an  MPE  from  a  considerably  larger  set  of 
nodes  than   M(o*)   when  demand  is  inelastic.   At  the  same  time,  our  constructive  methods 
show  why  collusion  is  not  possible  in  any  equilibrium  when  capacities  are  very  unequally 
distributed.   For  the  rest  of  this  section  we  assume  that   c^tt-t-   Denote  by  M*  the  set  of 
nodes  seS  from  which  there  exists  some  MPE   ctgE    satisfying:   {st(s,cr)}cM(a). 

Any  collusive  path  {s,(s, cr)}  occasionally  passes  between  the  firms'  "regions  of  pre- 
cedence", so  that  a  date  t  is  reached  wb en   cr(st)+a(st    ,)=(1,1).   From  Corollary  1  we 
know  that  V-(<7)(s,)<^/?(t— 4)   where  j  is  the  firm  for  which   <7.(s,)=0.   If  the  difference  in 
holdings  is  great  at   s   there  will  be  no  pt'th  which  maximizes  industry  revenue  and  keeps 
the  larger  firm's  payoff  below  this  bound.   But  this  means  that  the  larger  firm  would  not 
have  been  willing  to  forebear  in  order  to  arrive  at   s  in  any  revenue  maximizing  MPE.   So 
any  such  path  arriving  at  s  must  have  begun  with  the  smaller  firm  waiting.   There  is  a 
limit  on  how  long  a  firm  would  be  willing  to  wait,  which  may  be  determined  by  an  induc- 
tive argument  similar  to  that  used  in  the  proof  of  Theorem  3.   A  limit  is  ascertainable  in 
this  way  on  the  degree  of  disparity  in  initial  holdings  consistent  with  an  MPE  generating  a 
collusive  path.   We  can  then  use  Theorems  1  and  2,  and  the  notion  of  alternation,  to  con- 
struct an  MPE  generating  a  collusive  path  from  any  node  within  these  limits. 

Ignoring  Markov  strategies  momentarily,  let  us  study  those  time  paths  of  output 
which  avoid  simultaneous  production  everywhere  without  either  firm's  implied  payoff  be- 
coming "too  large".   Call  the  path  {q.:0<t<t}    admissible  from  s  if  (1)  q.e{(0,l),(l,0)}, 
Vt;  (2)  q  +q.+...+q    =  s;  and  (3)  the  value  of  the  sequence  {q    :t<t'<t},  for  either  firm 
and  any  intermediate  date  t,  0<t<t,  does  not  exceed  /?(t-4).    Consider  the  recursion: 
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(i)     n    =n',   and   n,  =n,     , +c,— 1,   kel,   where  for   k=l,  2,  ..., 

(4.7)  (ii)   ck  =  Max{cel|  /3(|=f )  +  ^-V^  <  /?({=£)},  and 

.        rC,  1+C,  ,        cW 

H  Vk  ~=  ^fe    )  +  S         -Vk-1>  with  Vo  ~=  #££  )• 
Then  we  have: 

Lemma  3:   An  admissible  path  from  s  exists  iff  n(s)<n,  /  %,  where  {n,},  kel,  solves  (4.7). 

Moreover,  for  kel,   c  <  ^-n^j+l  <  c+1,   where  cEMax{ceI|  f{l-S)/{l-f+1)>e}>l. 

Proof:   Consider  the  problem  of  finding  the  longest  possible  sequence  of  "l's"  and  "O's", 
given  that  exactly  k   "O's"   are  to  be  used  in  the  sequence,  such  that  no  "tail"  of  the 
sequence  ever  has  a  present  value  exceeding  /?(t— r),  where  a  "1"  earns  the  value  /?,  a  "0" 
earns  nothing,  and  the  discount  factor  is   6.   Let  n,  +k  be  the  maximal  number  of  "l's"  in 
this  sequence,  kel.   With  {h,  }  so  defined,  it  is  obvious  that  an  admissible  path  exists  from 
s  if  and  only  if  n(s)<n,  /  \.  The  recursion  (4.7)  is  the  Bellman  equation  for  this  optimiza- 
tion problem:   c,    is  the  number  of  "l's"  which  may  be  added  to  the  sequence  as  the  num- 
ber of  allowable  "O's"  increases  from  k— 1  to  k.   So,  c,  =(k+n, )— {k— 1+n,  _,),  which  is  (i). 
V,    is  the  present  value  of  a  sequence  at  its  maximal  length,  given  k.   So  (ii)  and  (iii) 
follow  from  the  Principle  of  Optimality  and  the  definition  of  present  value. 

From  (ii)  and  (iii):  /?(^^)<Vk</?(^),  kel.   Substitution  into  (ii)  yields: 

(4.8)  c  =  Max{cel|  <5c(l-<5)/(l-(5c+1)>e}  <  c,   <  Min{cel|  (5c(l-<5)/(l-<5c)<e}  =  c,  k>l. 


Since  tx^>£,  c>1;  while  the  definitions  above  imply  c<c+2,  and  hence  c<c,  <c+l.   More- 
over, if  f{l-5)/{l-f+1)>e>f+1(l-6)/{l-f+1),  then  c,  =c,  Vk>l.  d 


Define  N"={seS|k(s)=0  or  n(s)>n,  /  x— c,  /  \},  where  {n,  }  and  {c,  },  k>l,  are  as 
given  in  (4.7).   Notice  that  if  n(s)>hw  s— c,  /  \  then,  by  Lemma  3  and  Corollary  1,  no  col 
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lusive  MPE  path  is  possible  in  which  the  smaller  firm  produces  at  s.   Thus,  any  MPE  path 
from   serT  attaining  an  industry  revenue  maximum  involves  firm  i(s)   waiting  at  least 
n(s)— [h,  /  \-Ci  /  J   periods  before  producing.   Once  it  has  produced  on  a  collusive  MPE 
path  at  some  node  s'erT  for  which  n(5')=h,/>,—c,/>,,  the  ensuing  path,  necessarily 
admissible,  must  follow  the  pattern  of  production  implied  by  (4.7):   c,_.  units  supplied  by 
the  larger  firm,  followed  by  one  unit  from  the  smaller  firm,  for   l<t<k— 1.   Thus,  although 
admissibility  is  an  "open— loop"  concept,  it  provides  a  powerful  tool  for  assessing  the  limits 
to  collusion  in  MPE.   For  in  the  Appendix  we  prove  that  if  an  admissible  path  exists  from 
sgS,  or  if  i(s)  would  wait  from   s   to  reach  the  only  admissible  path  in  IT,  then  seM*. 

Theorem  4:   Given  any  sequence  {m,:k>l}cl  define  the  profile   a  on  rT  as  follows: 

(a)  If  n(s)=hk(s)^k(s)   then   *j(s)=0  and   3j(b)=1; 

(b)  If  k(s)=0,  or  mk(s)+hk(s)>n(s)>hk(s)^k(s),  then   <rj(s)=l   and   &.(s)=0; 

(c)  If  n(s)>nk^+mk^   then   a(s)=(l,l). 

*  

Then:  (i)  3  {mk:k>l}   for  which    o^Eivr;  moreover,  (ii)  {seS|n(s)<hk/  s+niw  JcM*. 

Proof:   See  the  Appendix. 

The  kinship  of  Theorems  3  and  4  should  be  apparent.  Theorem  3  constructs  the 
MPE   a*  from  the  restricted  equilibrium   a  on  ^(e)cS,  showing  that  constant  supply  by 
the  larger  firm,  and  optimal  waiting  by  the  smaller  one,  are  mutual  best  responses  on 
S\.2>(e).   Theorem  4  constructs  the  restricted  MPE   a  on  N  by  specifying  behavior  on 
the  "boundary"  of  rT  to  coincide  with  the  unique  admissible  path  in  that  set,  and  then 
showing  that  constant  supply  by  j  together  with  optimal  waiting  by  i   are  mutual  best 
responses,  given  this  specified  boundary  behavior.   Since   a  is  a  restricted  MPE  on  IT, 
there  exists  some   ae£     such  that   o=u  on  rT.   The  second  part  of  Theorem  4  is  based  on 
the  fact  that,  for  any  s^N,  we  can  choose  ~o  so  that  the  path   {s,(s,o}}  implies  alter- 
nating play  on  S\rT,  and  hence  (st(s,ff)}cM(ff).   We  conjecture,  but  have  as  yet  not  proven, 
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that   d  completely  characterizes  M*  in  the  sense  that  if  n(s)>n,  /  \+m,  ,  ^  then  s£M*. 

Theorem  4(ii)  states  that  the  existence  of  an  admissible  path  from  a  given  node  im- 
plies some  MPE  attains  an  industry  revenues  maximum  from  that  node.   So  Lemmas  2  and 
3  together  show  clearly  why  coordination  via  non-cooperative  play  is  more  difficult  to 
achieve  when  firms  precommit  to  paths  of  actions  than  when  they  use  state— contingent 
rules,  in  this  dynamic  duopoly  with  indivisible  production.   With  open-loop  strategies,  the 
possibility  of  coordination  turns  on  the  ability  to  find  a  path  for  which  the  discounted  value 
of  the  high  price  on  each  firm's  last  supply  date  is  no  less  than  the  discounted  value  of  the 
low  price  on  his  first  date  of  forebearance.   With  Markov  play  what  matters  is  whether  the 
right  to  produce  can  be  allocated  so  that  neither  firm  has  to  wait  when  the  value  of  contin- 
uing, on  a  prorated  per  period  basis,  exceeds  the  fall  in  price  induced  by  a  loss  of  coor- 
dination. Comparing  (4.2)  and  (4.7)  reveals  that  the  former  is  the  more  severe  constraint. 

Theorem  4  suggests  that  the  concepts  of  "alternation",  "forebearance",  and  "optim- 
al waiting"  are  sufficient  to  fully  describe  how  the  firms  might  tacitly  coordinate  their 
actions  in  this  setting.   As  the  proof  of  the  Theorem  reveals,  the  extreme  admissible  path 
implied  by  (4.7)  constitutes  a  "boundary"  separating  the  "interior"  of  rT  from  the  rest  of 
S.  From  any  seS\rT  the  firms  agree  to  alternate  until  reaching  this  boundary;  from  any 
serT  not  on  the  boundary,  the  smaller  firm  waits  for  the  boundary  if  it  is  in  its  interest  to 
do  so.   Play  along  the  boundary  gives  the  larger  firm  sole  access  to  the  market  roughly  the 
fraction   1— e  of  the  time  [see  (4.8)].   This  tacit  agreement  is  self— enforcing,  because  each 
firm  knows  it  to  be  in  the  other's  interest  to  conform  from  any  node  in  the  state  space. 

V.  Depicting  Discrete  Equilibrium  Dynamics  in  a  Continuous  State  Space 

Consider  a  continuous  formulation  of  this  game,  without  the  assumption  that  out- 

2 

put  rates  are  fixed  for  intervals  of  length  h.   The  state  space  is  all  capacity  pairs,  [R  ,  . 

o 
Each  firm's  (Markov)  strategy  maps  capacities  into  production  rates:   q-:R   -»{0,l},  i=A,B. 

The  profile  q=(q»,q-n)  partitions  the  state  space  into  regions  of  production,  q.  (1),  and 
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forebearance,  q-  (0),  for  each  firm,  and  induces  a  vector  field  determining  the  evolution  of 
the  system  over  time:   -r— =— q(R  ),  R    g-ven.    For   R=(R.,RR),  the  associated  phase  dia- 
gram shows  motion  "due  South"  on  qT  i  0)f1qR  (1),  "due  West"  on  qT  (l)nq^  (0),  and 
"Southwest"  on  q.    (l)fiqR  (!)•   Payoffs  from  any  node  are  given  in  the  obvious  way,  as 
the  discounted  integral  of  instantaneous  revenue  flows.   An  equilibrium  consists  of  a  stra- 
tegy pair  from  which  neither  firm  can  affect  a  unilateral  improvement  in  its  payoff  from 
any  node,  where  strategies  are  restricted  so  that  the  differential  equation  describing  the 
evolution  of  the  system  always  has  a  solution.    Any  such  restriction  will  imply  that  the  sets 
q.  (1)  have  a  relatively  simple  topological  structure. 

Now,  let  {a  :h>0}  be  a  family  of  MPE  of  the  discrete  game,  indexed  by  the  period 
length  h.   For  h  fixed,  a    will  cause  the  vector  of  capacities,  R  ,  to  evolve  over  time  in 
the  following  manner:   Given  R  ,  a  unique  initital  state  s  (h,R  )«h     -R     is  determined, 
as  is  the  path  {s.(s  (h,R  ),a  )}.   So, 

Rr+h-Rr  =  -h-ahK(So(1'Ro)'c7h))  =  -h-ah(So(h'Rr))'   Vr-h-t,  tel. 
By  analogy  with  the  continuous  case,   {o-l:h>0}   defines  a  vector  field  on  K      given  by: 

g  =  -^(Rr),-lim{ah(so(h,Rr))}, 
h-^0 

if  the  limit  exists.   The  implied  phase  diagram,  showing  the  motion  over  time  of  (R  :  r>0), 
would  mimic  closely  the  path  of  capacities  and  the  payoffs  generated  by   a  ,  for  "small"  h. 
The  MPE   cr*  defined  in  Theorem  3  treats  h  parametrically,  so  it  actually  defines 
a  family  of  MPE,  {cr*  :h>0},  as  envisioned  above.   But   a*     is  an  extension  of  a  ,  the 
alternating  profile,  and  {a  (s  (h,R))}  has  no  limit,  as  h-»0:   Alternation  leads  to  "chat- 
tering" for  infinitesmal  h.   The  continuous  approach  outlined  above  could  not  produce 
behavior  in  equilibrium  like  that  which  emerges  in  the  equilibria  we  have  studied  for  the 
discrete  game,  with  h   "small".  This  is  not  necessarily  a  criticism  of  the  discrete  approach; 
one  might  as  readily  conclude  that  the  continuous  formulation  is  flawed,  to  the  extent  that 
it  fails  to  capture  the  strategic  possibilities  inherent  in  a  convention  of  "turn— taking". 
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Moreover,  this  "chattering"  in  the  limit  need  not  prevent  a  phase  diagrammatic  depiction 
of  equilibrium  dynamics;  we  can  simply  define:  (j,^)  e  lim{a  (s  (h,R))},  as  h->0,  VReR  ,  ,  . 

With  this  definition  in  mind,  note  that  the  equilibrium   a*   defined  in  Theorem  3 
partitions  the  state  space  S  into  three  mutually  exclusive  and  collectively  exhaustive 
regions:  a  region  of  alternation,  3>;  a  region  of  forebearance  by  the  smaller  firm,  3f;  and  a 
region  of  simultaneous  production,  <tf.   Drop  the  explicit  dependence  of  3)  on   e.   Clearly, 
Jr={seS|0<m(s)<m?/  >.}  and  o^fseS  |  m(s)>mw  >.}.   The  dynamics  generated  by   a*,  for 
"small"   h,  satisfy:  37=-^),  if  sQ(h,R)G  3  as  h-»0;  ^=-(1,1),  if  sQ(h,R)6  &  as  h-»0; 

and,  ^=-(1,0)  [-(0,1)],  if  s  (h.RJe,?  as  h->0,  for  j(s0(h,R))=A  [B].   The  results  in 

2 

Section  IV  (and  their  proofs)  can  be  used  to  solve  for  the  regions  of  K  ,    in  which  these 

various  motions  obtain,  even  when  explicit  formulae  cannot  be  found  in  the  discrete  case. 

Abusing  notation,  write  Re  3  [«?,  &]  if  3E>0  such  that,  Vh<E,  sQ(h,R)6^  \&,  &\. 

Denote  xe|R.— Rg|,  yEMin{R.,RR}  and  zEMax{R.,RR};  then  h-+0=>h-n(s  (h,R))-»x  and 

h-k(s  (h,R))-»y.   Define   w(x,y)=lim{h-m(s  (h,R))},  and  w*(y)  =  lim  {h-m*,     (\.j>\\}- 
0  h^O  °  h-.0  KlVn'tt" 


Proposition  2:  The  limiting  dynamics  implied  by  the  MPE  (family)    a*  on  the  continuous 

2 
state  space  K     =  (3U3rU<^')  are  as  described  above,  where: 

Re3  if  e~rx-e_2ry/  +  ^(l^T2ry')  >  e,Vye[0,y]; 

RG^  if  0  <  w(x,y)  <  w*(y);  and,   Reo^if  w(x,y)  >  w*(y). 

Moreover,   w(x,y)  =  x  +  -ln(e),  e<^;  w(x.y)  =  x  +  j-ln^  +  (e_2~)e  ^  £>2'   and: 

(5.1)  w*(y)=i.ln[(l+e-2ry)/26],   e<£; 

(5.2)  w*(y)  =  Max{0,  l.ln[s(z)^(z)l£^]}|    e>l, 
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for   u(y)=[^=^-^— ]>0<y<y=-ln(2f.-l)/2r.   When   e>l,   w*(y)=0,  y<y<y,  for  ye(0,y). 
4+(2e-l)e    yJ  A 


Proof:   The  argument  is  a  straightforward  consequence  of  taking  limits  in  (4.4),  (4.5)  and 
(4.6),  except  for  the  derivation  of  (5.2),  which  is  in  the  Appendix.   By  Corollary  2,  Re  3>  if 
3K>0  such  that,  Vh<K:  cj>(n(s  (h,R))+L,k)>e,  Vkel,  k<k(s  (h,R)).   Then  (4.4)  implies: 

<j)[n(so(h,R))+l,k(so(h,R))]^e"rx-e~2ry  +  ^(l^e~2ry),  as  h  -  0. 
Moreover,  from  (4.5)  one  sees  that   m(s  (h,R))  must  "solve":  <Kn— m+l,k)«e.   Equating 
the  limit  above  with    e,  substituting   (x— w)   for  x,  and  solving  for   w,  gives   w(x,y).   Let- 
ting h^O  in  (4.6)  gives:    e  s  ^    *  )(l-^k_1)  i  2-e-rw*-(l+e_2ry),  as  h-»0,  which  is 
(5.1).   The  derivation  of  (5.2)  is  more  co-nplex  since  no  explicit  representation  of  {mf }  is 
available  when   OrriS  equation  (A. 2)  in  the  proof  of  Theorem  3  must  be  utilized.  □ 


It  follows  from  (5.1)  that,  when  c<^,  a  coordination  failure  occurs  under  the  MPE 

1      re-ry+e~ryl  ,     N 

a*  for  "small"   h  if  z  —  x+y  >  --In  - — ^ .   The  comparable  limit  in  (4.2)  implies 

r       L    It         J 

1       fe-2ryl 
that  a  coordination  failure  must  occur  in  any  open— loop  equilibrium  if  z  >  --In  *—  . 

•-     e    -• 
This  gives  a  clear  measure  of  the  superiority  of  closed-loop  play  for  achieving  coordination. 

A  comparable  measure  can  be  derived  from  (5.2),  which  shows  that  the  advantage  of  state- 
contingent  play  vanishes  as   e-»l.   Observe  that  the  expressions  in  (5.1)  and  (5.2)  converge 
as  e-»l/2,  so  equilibrium  behavior  is  continuous  in  the  elasticity  of  demand.   It  is  easy  to 
compute  firms'  payoffs  for  various  endowment  configurations  using  the  results  of  Proposi- 
tion 2,  and  so  to  verify  our  assertion  that  the  larger  firm  garners  a  greater  than  propor- 
tionate share  of  industry  earnings  from  nodes   R£&  where  w(x,y)»w*(y).   Figures  4A  and 
4B  depict  the  limit  phase  diagrams  for  ihe  MPE   a*  in  the  respective  cases    e<jr  and  e>^. 
The  power  of  this  limit  argument  is  made  clear  by  its  application  to  the  restricted 
MPE   a  defined  in  Theorem  4.   For  there  the  discrete  game  is  very  difficult  to  analyze, 
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but  its  continuous  analogue  is  transparent.   Lemma  3,  and  equations  (4.7)  and  (4.8),  can  be 
employed  to  see  that   s  (h,R)erT,  for  h  sufficiently  small,  if  2>-ln(e)/r  +  (— )-y.   The 
extreme  admissible  path  defining  the  boundary  of  rT  "chatters"  with  infinitesmal   h,  since 
(by  (4,8))  c,e{c,  c+1},  where  c=Max{et-:r|c< — -}.   But  as  h-»0  this  boundary  becomes  a 
line  with  slope  (relative  to  the  larger  firm's  axis)  -r^-,  along  which  the  larger  firm  produces 
on  average  the  fraction  1— e  of  the  time.   (See  Figure  4C.)   The  proof  of  Theorem  4  shows 
that  one  can  extend   a  in  such  a  way  that,  "above"  this  line,  the  firms  agree  to  take 
alternate  turns.   The  profile   a  requires  that  "below"  this  line  the  larger  firms  produces 
constantly  and  the  smaller  firm  waits  optimally.   Optimal  waiting  is  equivalent  to  the  effi- 
cient use  of  an  option  to  go  to  the  boundary  of  rT,  taking  the  payoff  implied  by   a  there. 

Let  us  see  how  the  limit  of  the  waiting  times,  w(y)El  im{h-m,  /     ,,  -q\\}>  an<^  hence 

the  behavior  implied  by   a,  may  be  derived.   From  equation  (A. 3)  in  the  Appendix  one  can 
verify  that  the  problem  facing  the  smaller  firm  at  any  node  s  for  which  n(s)>h,  ,  \  is  to 
decide  how  long  to  earn  @e  by  producing  each  period,  before  waiting  to  move  to  the  ex- 
treme admissible  path  where  the  firm  can  earn  j3  each  time  it  produces,  but  only  gets  to 
produce  the  fraction   e  of  the  time.   (Because  of  discounting,  earning  /?  for  duration   e, 
and  nothing  for  duration   1— e  is  better  than  earning  @e  continuously,  so  waiting  might 

pay.)   For  each  unit  of  time  that  the  firm  produces,  the  wait  increases  by  roughly  

units  of  time,  since  the  boundary  of  N  has  slope  —  while  the  path  along  which  both 
firms  produce  has  slope  1.   Passing  to  the  limit,  if  the  smaller  firm  has  y  units  of  stock,  if 
the  wait  for  the  boundary  of  rT  is  initially  w,  and  if  the  firm  produces  for  t   units  of  time 
before  beginning  to  wait,  then  its  earnings  are: 

(5.3)  n(y,t,w)  =  |.{[l-^rrt]  +  ^-^[e-rt/'W-^/6]},   y>0,  w>0,  and  0<t<y. 
Accordingly: 

(5.4)  w(y)  =  Max{w>0|  n(y,0,w)  >  Max  II(y,t,w)}. 

0<t<y 
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One  may  verify  from  (5.3)  that   d"U/dwdt>0,  so  that   n(y,0,w)>fl(y,t,w),  Vte[0,y], 
if  and  only  if  w<w(y).   Notice  as  well  that    II  is  not  generally  a  concave  function  of  t. 
This  fact  has  the  implication  that  the  equilibrium  path  under   a  may  not  be  continuous  in 
the  initial  conditions,  since  the  argmax  in  (5.4)  need  not  be  a  continuous  function  of  w  and 
y.   [E.g,  this  continuity  is  crucial  to  the  derivation  of  (5.2).]   Thus,  it  may  be  desirable  for 
the  smaller  firm  to  wait  to  reach  the  boundary  of  N"  from  some  node,  and  yet  a  slight  de- 
crease in  endowment  would  cause  its  strategy  to  switch  to  producing  for  a  protracted  per- 
iod.  A  number  of  interesting  results  along  these  lines  can  be  derived  with  remarkable  ease 
from  (5.3)  and  (5.4),  as  exemplified  by  the  following: 


In 


1-e 


-ry/<h 


.    Some  MPE  extension  of  a 


Proposition3:    Assume  e<l/3.   Then  w(y)E--__ 

r       L  1-e  ry 

attains  the  high  price  continuously  if  z  <  — ln(e)/r  +  (1— e)y/e  +  w(y),  for   h   sufficiently 
small.   If  the  inequality  is  reversed,  then  xhe  low  price  obtains  at  every  date  until  the 
smaller  firm  has  exhausted,  under  any  MPE  extension  of  a,  when  h  is  small  enough. 

Proof:  Using  subscripts  to  denote  partial  derivatives,  (5.3)  implies: 

r-1nt(y,tlw)  =  e-rt-er(t-w).[(^)e-rt/6  +  e-ry/e];  while, 

r"2n    (y,t,w)  =  ^Trt  +  er(t-w)-[(^)2e-rt/e-e-ry/£]. 
tt  e 

Thus:   [r-1nt+r~2ntt]  =  er(t_w)-[(^)(-1-=2^)e~rt/e  -2e~ry/e]  >  0,  0<t<y,  Vw>0,  since 
e<l/3.  So  n, <0  3  n,,>0,  and   n,,<0  3  II  >0.   In  particular,  the  function   n(y,t,w)  never 
attains  an  interior  maximum  with  respect  to  t  on  the  interval  [0,y].  Thus,  since  IT.    >0, 
for  any  y>0  there  exists  w(y)   such  that   n(y,0,w)>n(y,y,w)  iff  y<w.        □ 

VI.   Conclusion 

In  the  model  of  dynamic  duopoly  studied  here  firms  regulate  an  "on-off '  production 
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process,  depleting  their  respective  supplies  of  an  non— replenishable  factor  of  production  at 
a  constant  rate  whenever  their  process  hi  "on".   When  both  are  producing  the  price  is  low. 
If  only  one  of  them  produces  an  any  moment  the  price  is  high.   Each  of  them  knows,  at  any 
moment,  the  remaining  capacity  to  produce  of  the  other.   Two  notions  of  equilibrium  are 
investigated.   In  open— loop  Nash  Equilibrium  the  firms  commit  at  the  initial  date  to  time 
paths  of  production  which  are  mutual  best  responses.   In  Markov  Perfect  Equilibrium,  the 
firms  specify  state— contingent  plans  of  actions  which  are  mutual  best  responses  in  every 
contingency.   The  capacity  of  the  firms,  in  these  alternative  strategic  settings,  to  achieve 
coordination  of  their  actions  via  non-cooperative  play  has  been  systematically  investigated. 

By  assuming  firms  can  change  their  actions  only  on  exogenously  given,  equally 
spaced  moments  in  time,  we  convert  the  problem  to  a  discrete  time  game  which,  due  to  the 
dichotomous  production  structure,  can  be  explicitly  solved.   The  possibility  for  coordin- 
ation to  be  attained  via  non-cooperative  play,  in  either  strategic  setting,  is  characterized 
(in  Lemmas  2  and  3)  by  two  relatively  simple  integer  programming  problems.   These 
characterizations  allow  us  to  derive  the  main  comparative  results  of  the  paper:   that  the 
ability  to  precommit  undermines  the  possibility  of  coordination;  and  that  state-contingent 
strategies  are  more  effective  at  coordinating  play  when  firms  are  symmetrically  placed. 

The  analysis  emphasizes  a  difference  between  discrete  and  continuous  time  dynamic 
games  recently  stressed  by  Simon  and  Stinchcombe  (1989).   The  extensive  form  analyzed  in 
the  discrete  game,  though  restricted  to  Markov  strategies,  reveals  a  rich  structure  largely 
hidden  from  view  in  a  continuous  time  treatment.   The  idea  of  "precedence"  —  firms'  mak- 
ing their  play  contingent  on  "who  did  what  last  period"  —  is  fundamental  to  achieving  max- 
imal coordination  in  this  environment.   Of  course,  this  finding  is  related  to  the  "on— off ' 
process  of  production  which  has  been  assumed  here,  but  the  point  would  seem  to  be  of  more 
general  import.   The  results  in  section  V  on  the  limit  behavior  of  our  Markov  Equilibria 
suggest  the  possibility  of  profitably  using  this  discrete  approach  in  other  settings  where 
continuous  time  formulations  have  been  reflexively  adopted. 
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Appendix 


Proof  of  Theorem  1: 

k  k 

Define  S  ={(a,b)eS|a+b<k},  kel.   The  sets   {S  }   are  nested,  cover  S,  and  are 

closed  to  deviations  from  any  profile   ugE.   Therefore  E,   ,     -, cS •   ,,  k>l,  and  hence 

E  =f    |{^i    v;k>l}-   Now,  clearly,  S.    ^E.   For  fixed  k>l,  for  any   ctgE,    ,,  and  for  every 
I     |      |gK  |gi  ISK 

k+1     k 
se(S        \S  ),  one  of  the  alternatives  (3. 2)(a)-{c)  holds  at  s.   Moreover,  for  such  k  and  s, 

the  continuation  payoffs  in  (3.2)(a)— (c)  do  not  depend  on  values  taken  by   a  on  S\S  . 

Therefore,  given  WcS  and   aGS,w,  the  following  inductive  procedure  constructing 

the  profile   <7*eE  is  well  defined.   For  successive  values  of  k=l,  2,  ...:   (1)  At  each  node 

se(Sk+1\Sk)n\V,  set   o*(s)=o(s).   (2)  At  each  seSk+1\(SkUW),  assign  a  value  to   o*  con- 

*  * 

sistent  with  the  Bellman  Inequalities  there.   Now   <7eEiw  implies   aeE,    ,        ,  k>l.   Thus 

|W  lsKnW 

*  * 

Lemma  1  implies  that    a*eE,   ,  ,  at  each  step  k.   Therefore   u*e£  .         q 

Proof  of  Corollary  2: 

Let   seS  be  strictly  positive,  with  n=n(s),  k=k(s),  i=i(s)  and  j=j(s).   By  the 
definition  of  the  alternating  profile  we  have  that: 

(i)   V.(a)(s)  =  ^.[(l-^2k-2)/(l-«2)  +  82k-2-(l-8n+1) /(1-5)},  it  a-.(8)=0;  and, 

(ii)  V.(a)(s)  =  8P(l-62k)/(l-P),  if  ai(s)=0. 
Now,  by  Definition  4.1,   se  .25(e)  if,  for  every  s'  a  strictly  positive  successor  of  s   under 
a,  V.(a-)(s')<£(jEj)  when   cr.(s ' }=0,  and  V.(o-)(s')<^3)  when   3j(-s'j=0.   Notice  that 
the  RHS  of  (i)  exceeds  the  RHS  of  (ii),  for  any  n>0,  k>l.   Therefore,  when   a-(s)=0  then 
se^(e)  if:   (1)  [RHS  (i)]<$(yEj)  for  Kk<k(s)  and  n=n(s);  and,  (2)  [RHS  (ii)]<#?(^)  for 
l<k<k(s)-l.   When  a.(s)=0  then  se3{e)  if:  (1)  [RHS  (i)]<#3(^j),  for  l<k<k(s)  and 
n=n(s)-l;  and,  (2)  [RHS  (ii)]<£,5(t4)  for  l<k<k(s).   Condition  (2)  is  redundant  in  each 
case  if  n>l.   Given  (4.4),  condition  (1)  is  equivalent  to:  (J)(n(s),k)>e,  l<k<k(s),  if  cr.(s)=0: 

and,  <J>(n(s)+l,k)>e,  l<k<k(s),  if  cr.(s)=0.   But  if  (4.3)  holds  then  (j)(n,k)>e,  V  k>l,  if  and 
only  if  n<n' .   This  completes  the  proof.         □ 

Proof  of  Proposition  1: 

Suppose   e<l/2.   Let  R=(R.,Rr>)  be  some  vector  of  initial  stocks  such  that 
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RA+RB=R.    Given  any   h>0,  Corollary  2  implies  so(h,R)e  3(i),  VR">0,  if: 

(i)     |RA-RB|  < -ln(e)/r,  and 

(ii)  h  <  ln(^)/r. 

Condition  (i)  assures  that   n(s  (h))<n'(h)— 1  (neglecting  rounding  error,  which  vanishes  for 
small  h),  while  condition  (ii)  gaurantees  :hat  (4.3)  holds.   Theorems  1  and  2  together 
imply  there  exists  an  MPE  profile  a  coinciding  with   a  on  -2>(e).   Since  s  £^(f),  from 

the  definition  of  a,  we  conclude  that   {s.^s  ,a)}cM(a). 

Suppose   e>l/2,  and  continue  to  denote  by   a  an  MPE  satisfying   a=a  on  3){t). 
Let  R=(RA,RB),  with  RA+RB=R~,  and  define  |Ra-Rb|ex,  Min{RA,RB}Ey.   Fix  h>0 

and  let  s=s  (h,R).   It  follows  from  C4. 4)  that   sG^(e)   when   ir-,  •  (s)=0  if  and  only  if: 

o  }{t>) 

(A.l)  5(h)2k(sH  >      e-\l+5(h)]-6{h) 

-^h)^s^[l  +  ^h)]-l 

Take  the  limit  in  (A.l)  as  h-+0,  using  the  definitions  of  s,  n(-),  and  k(-),  to  see  that 
s(h,R)e  .25(c)  for  h  sufficiently  small  if: 

(iii)    e~2ry  >  (2e-l)/(2e~rx-l). 
As  long  as: 

(iv)  R  <  -ln(2e-l)/r, 

it  will  be  possible  to  select  x>0  and  y>0  such  that  x+2y=R~,  and  (iii)  holds.   Now,  by 

the  analysis  leading  to  (4.1)  we  know  that   Q*(s  )=1  iff:   n(s)>n(s)+2k(s)sR~.   But  (4.1) 

implies  h-n(s  (h,R))  -»  — ln(2e— l)/r,  as  h-»0.   Thus  (iv)  is  satisfied  whenever  R  is  below 

the  threshold  at  which  collusion  becomes  desirable.   For  such  R"  it  is  always  possible  to 
divide  the  total  endowment  such  that  (iii)  holds.   Thus,  for  h  is  small  enough  we  can  find 

R=(RA,RB),  RA+RB=R~,  such  that  {st(sQ(h,R),a)}cM(a)  whenever  Q*(sQ(h,R))=l.      □ 

Proof  of  Theorem  3: 

We  have  to  show  that  the  profile  t*  is  well  defined,  and  that  neither  firm  will 
deviate  from  it  at  any  node  seS\  3){i).  This  amounts  to  showing  two  things:  (1)  that 
there  exists  a  unique  sequence  of  integers  {m?}  such  that  on  S\J?(e)  the  strategy  "wait  for 

3)(e)  from  s  if  you  can  get  there  within  mw  \  periods"  is  a  best  response  for  i(s)   to  the 

strategy  "always  produce"   for  j(s);  and,  (2)  the  strategy  "always  produce"  is  a  best 
response  for  j(s)  to  the  strategy  given  in  (1)  for  i(s).   The  proof  of  Corollary  3  establishes 
(1)  in  the  case   e<8/(l+6),  when  an  explicit  formula  for  {mf }  can  be  found.   The  nested 

induction  employed  there  may  be  extended  to  the  general  case   e€(0,l)   as  follows. 

Consider  S(k)={seS\i5(e)|k(s)<k},  k>l.   Given  any  integers  {m,,  ...,  m,  },  the 
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profile  defined  in  the  Theorem,  and  thus  each  firm's  value  function,  is  uniquely  determined 
at  every  node  in  S(k).   Therefore,  the  following  induction  uniquely  defines  a  sequence  of 
integers   {mf},  k>l,   and  so,  by  the  definition  in  the  Theorem,  a  profile   o*: 

(A. 2)      m*  =  n'-l;   and  for  k>2, 

2k 
m*  e  Max  {m(s)}   s.t.   k(s)=k,  and  £m(s)+1-[— o  ]  >  0e+8V.,Jo*)(s-(l,l)), 
k     seS  i-r  1[s) 

where  mt =0  if  the  inequality  always  fails.   This  definition  is  meaningful  because  at  each 
k>2   the  profile   a*   and  values  V-(cr*)(-),  i= A, B,  are  already  defined  on   S(k— l)U^(e); 
and,  k(s)=k  implies  s— (l,l)eS(k— l)U^(e).   The  integers  mf   are  defined  by  the  require- 
ment that  waiting  to  reach  an  alternating  path  is  dominated  for  i(s)   by  producing  when- 
ever m(s)>mf .   What  we  have  to  show  is  that  waiting  for  one  period  at   s  is  optimal  for 

i(s)   if  and  only  if  m(s)<mf ,  Vs6S\^(e).   For  this  we  use  the  following  nested  induction. 

Suppose:  (a)  i(s)  will  not  deviate  from   a*  on  S(k— 1),  k>2;  and,  (b)  the  inequality 
in  (A. 2)  fails  at   s,  where  k(s)=k  and  m(s)=m>l.   Then  that  inequality  also  fails  at   s', 
where  k(s')=k  and  m(s')=m+l.   By  (b),  waiting  to  reach  -2>(e)  from  s'   is  worse  for  i(s) 
than  waiting  for  one  period,  producing  for  one  period,  and  then  following   o* .   While,  by 
(a),  producing  for  one  period  and  then  following   a*  is  at  least  as  good  as  producing  for 
one  period,  waiting  for  one  period,  and  then  following   a*.   Thus,  producing  and  then  fol- 
lowing  o*   dominates  waiting  to  reach  i?(e)  from  s'.   So  either,  for  all  seS(k)  with 
k(s)=k,  producing  for  one  period  and  then  following   a*  is  better  than  waiting  to  reach 
@(e),  whence  rn^EO;  or,  for  some  m?>0,  the  inequality  in  (A. 2)  holds  iff  m(s)<mf .   For 

fixed  k,  this  induction  on   m  implies   i(s)   follows    a*   on   S(k)   if  it  does  so  on   S(k— 1), 
k>2.   The  induction  on  k,  and  the  fact  that  i(s)  follows   a*  on  S(l),  then  imply  that  the 
"optimal  waiting  strategy"  defined  in  (A. 2)  is  firm  i's  best  response  on  all  of  S. 

We  now  show  that  the  larger  firm  will  not  deviate  from  constant  production  on 
S(k),  k>l,  given  that  the  smaller  firm  follows   a*.   We  do  so  by  induction  on  k.   Clearly 
j(s)  will  not  deviate  from   o*   at  any  node  where  i(s)  is  waiting,  so  we  need  only  consider 
s  tor  which  m(s)>mw  y  We  argued  in  the  text  that  j(s)  follows   a*  on  S(l).   Assume 

this  is  so  on  S(k— 1),  k>2.   Take  any  seS(k)   such  that  k(s)=k  and  m(s)>mf.   Let   s'  be 

the  node  reached  if  j(s)  follows   a*,  let  s "  be  the  node  reached  if  j(s)   deviates  from   a*, 
and  let  m=m(s').   There  are  three  cases:   (1)  m>mf_, ;  (2)  m<m?;_, ;  and  (3)  m=m?_, . 

Case  (1):   By  the  definitions,  m(s' ')=m+l.   So  in  this  case  i(s)   will  be  playing  "1" 
at  the  successor  to  node  s,  whether  j(s)  follows   a*  or  not.   But  then  j(s)   cannot  gain 
by  deviating.   For,  given  discounting,  firm  j  does  better  to  follow   a*   at   s  and  then 
deviate  at   s',  than  to  deviate  at  s   and  then  follow   o*  at   s",  since  both  paths  termin- 
ate after  two  periods  at  the  same  node.  Yet,  by  the  induction  hypothesis,  the  payoff  from 
the  former  course  is  a  lower  bound  on  the  value  of  following   a*   at   s,  and  the  payoff  to  the 
latter  course  is  the  best  that  can  happen  if  firm  j  deviates  at   s. 

Case  (2):  In  this  case  i(s)  will  be  waiting  to  reach  @{e)  from  the  successor  to  s, 
whether  j(s)  follows  o*  or  not.  So  j(s)  does  better  under  a*  from  s"  than  it  would 
do  under  the  pure  alternating  profile.   Moreover,  since  s£3(e),  it  follows  that: 

Vj(s)(0(s"')>Vj(5)(a)(s"')>«±3- 
So,  were  j(s)   to  play  "0"  at   s,  its  value  there  would  exceed   6(3(1— e)/(l— S).  But  then  the 
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reasoning  of  Theorem  2  and  Corollary  1  implies  firm  j   will  not  wait  to  "take  its  turn" 
under  such  circumstances.   Hence,  j  follows   a*   at   s. 

Case  (3):   In  this  case  if  j(s)  follows   a*  it  goes  to  the  node  s'  where  i(s)   begins 
waiting  to  reach  -S>(c),  while  by  deviatirg  it  goes  to  s"  where  i(s)  is  not  waiting.   We 
know  from  Case  (2)  that  if  i(s)  were  to  begin  waiting  at  s"  then  it  would  not  pay  for 
firm  j  to  deviate  at   s.   Thus,  we  can  dispose  of  this  case  if  we  can  show  that  firm  j  is 
worse  off  at   s"  when  firm  i  produces  there  than  when  firm  i  begins  waiting  there.   Let 
t'    denote  the  first  date  at  which  firm  i(s)   waits  along  the  path  {s,(s'  ',o*)},  0<t'<k— 1. 

If  t'=0,  we  are  in  Case  (2).   A  critical  fact  for  this  proof  is  the  following: 

m(st/(s",a*))  <  m(s"),   l<t'<k-l. 

The  waiting  time  to  reach  3){e)   does  not  rise  as  one  moves  along  the  path  {st(s'  ',&*)}■ 

To  see  this  notice  that,  by  Corollary  2,  m(s)=Min{meI|Vk,  l<k<k(s),  <}>(n(sV-m+l,k)>e}. 
The  discussion  following  Corollary  2  in  the  text  implies  that,  as  k(s)  falls,  the  critical  value 
of  n  for  which  4>(n,k)>e,  l<k<k(s),  does  not  decline.   But   n(s,,(s",<7*))=n(s"),  and 

k(st,(s",cr*))<k(s")=k-l,  by  the  definitions.   Thus,  m(st,(s",a*))<m(s")- 

With  this  observation  in  hand,  we  complete  the  proof  by  showing  that  firm  j  is 
worse  off  under   a*   at   s"  if  firm  i  produces  there,  than  if  it  does  not.   Given  that  firm  i 
is  about  to  begin  waiting  for  .0(e),  it  is  obviously  better  for  firm  j  if  the  wait  is  longer. 
Define  V*(t)=V./\(oJ|c)(s"),  given  that  t'=t;let  s=s.(s",a),  and  m=m(s,).  We  need 

to  show  that  V*(0)>V*(t),  l<t<k— 1.   Frcm  the  definition  of  the  profile   o*  we  have: 

,    rt  ,  ,    r        t  1+t+m, 

V*(t)  =  /?e(i=|  )  +  fitffi  )  +  S  -Vj(s)(a)(st),   0<t<k-l, 

where   s,E3(e)   is  the  first  node  reached  at  which  firm  j  waits.   Since  a  longer  wait  by  i 
favors  j,  and  since  m,    ,<m,,  we  know  that  V*(t+1)  is  no  greater  than  the  value  implied 

above  when  m,    ,=mt  is  assumed.   Biu  m,    .=m.   implies   §t_,  1=s. — (1,1).  Therefore: 

t  f+1  1+mt  1+mt 

v*(t)-v*(t+i)  >  pe{(^  H^j-  )}  +  p{s\^l      Ht+1({^      )) 

1+t+m 
+  &  {Vj(s)(&)(st)-^Vj(s)(5^)(st-(l,l))}J   0<t<k-l. 

Now  because   s,6^(e),  and  j  is  waiting  at   s\,  we  have:  V.(a)(s.)>0t+6V.(cr)(s.— (1,1))- 
t  i  j         t  j       v  t 

Substituting  this  inequality  into  the  expression  above  yields  the  following: 

1+m 
V*(t)-V*(t+1)  >  p5l{l-e)(l-6        l)  >  0,   0<t<k-l. 

So  V*(0)>V*(t),  l<t<k-l.   We  conclude  that  j  will  not  deviate  from   a*  in  Case  (3).       □ 

Proof  of  Theorem  4: 

(i)    We  begin  by  noting  that  N"  is  closed  to  deviation  from  a,  in  view  of  the  fact  that 
n,  >ri,  _-,  k>l.   Thus  we  must  show  that  neither  firm  can  gain  by  deviating  at  any  node 
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seN".   Neither  will  deviate  when  it  alone  produces.   The  proof  of  Theorem  2  implies  j(s) 
will  not  deviate  from  waiting  at  nodes   seN  for  which   n(s)=n,  /  \— c,  /  \,  since  its  payoff  at 

the  successor  node  under   a  does  not  exceed   <5/?(t— r).   So  what  must  be  shown  is  that,  for 

some  {m,  },  i(s)   will  not  deviate  when   a- /  n(s)=0,  and  neither  firm  will  deviate  when 

o-(s)=(l,l).   The  structure  of  the  argument  is  similar  to  that  used  for  Theorem  3. 

Consider  first  firm  i(s)    We  assert  the  existence  of  some  sequence  of  integers 

{mk}cl,  yet  to  be  determined,  nich  that   "a.,  x(s)=0  iff  nk(s)~ck(s)<n^s^-iik(s)+™k(s)" 

is  a  best  response  to  "a-/  \(s)=l   iff  h,/ „\-Cw  >><n(s),"   seN.   Let  N"  E{seN|k(s)<k},  kel. 

A  nested  induction  of  the  sort  used  in  the  proof  of  Theorem  3  establishes  that  the  optimal 

k      k— 1 
behavior  of  firm  i   must  be  of  the  "optimal  waiting"  form:  I.e.,  if  for  some  seN  \N 

with  h,  — c,<n(s)  it  is  optimal  for  i  to  produce,  then  it  is  also  optimal  for  i  to  produce 

k      k— 1 
at  s'eN  \N         for  which  n(s')>n(s).   The  argument  turns  on  the  fact  that,  due  to  dis- 
counting, it  could  never  pay  i  to  wait  in  order  to  reach  a  node  at  which  i  will  produce, 
when  it  could  produce  and  reach  a  node  from  which  it  might  choose  to  wait.   So  we  need  to 

show  that  the  length  of  i's  optimal  wait  on  N"  \N  —    is  at  least  c, .  This  is  obviously 

true  for  k=l,  since  i   would  wait  a  total  of  n'  periods  to  play  its  one  unit,  so  (consulting 

(4.7))   m^n'— c.>0.   Suppose,  inductively,  that   {in,,  ...,  m,    ,}cl,  k>2,  are  such  that   a 

gives  i's  best  play  on  N       ,  and  consider  his  optimal  behavior  at  seN  \N 

Now  waiting  by  i   takes  it  to  a  node  s/    for  which  n(s/)=n\  — c,  ,  after  which   c 

causes  the  following  pattern  of  play  to  ensue:  i  produces  for  one  period,  j  produces  for 
clc_i    periods,  i  produces  for  one  period,  j  produces  for  c,    2  periods,  etc.  We  know 

that  v;(^)(sic)=<^vk-l'  for  Vk-1  given  in  (4J)-  Let  TkE2k+nk-V=n/+k+(c1+---+ck--l) 
be  the  sum  of  firms' capacities  at   s/.   Therefore  firm  i's  payoff  at  s/    is  given  by: 

(i)  wk  e  VjC&k^o  =  p(i-s  k)/(i-6)  -  svk_r 

Assuming,  inductively,  that  V(a)(s)  is  well  defined  on  N~  —  ,  we  conclude  that  waiting 

k      k— 1 
will  be  optimal  at   seN"  \N"         if  and  only  if: 

i  \  i    ^        m(s)+c,    ,+...+Ci    . 

(ii)  ^Ww,  >  Pe+6-V.(~a)(s-il,l))  =    Max  {/3e(«  +  6  k_1  ^-W,      }, 

K  1  l<t<k  l~°  K    l 

where  m(s)sn(s)— h,  +c,  ,  and  W  =c  =0.   The  inequality  in  (ii)  states:  "i  prefers  to  wait 

at   s."  The  equality  in  (ii)  uses  the  induction  hypothesis  to  infer  that,  conditional  on  i 

producing  at  s,  i's  optimal  Kjhavior  on  N"  —     takes  the  form:   continue  to  produce  until, 
with  k— t   (possibly  zero)  units  left,  it  pays  to  wait  for  the  node  s/_.,  and  associated 

payoff  W,  __. ."   We  need  to  show  that  the  inequality  in  (ii)  holds  when  m(s)=c,  ,  so  as  to 
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k  U      1  

conclude  that   i   waits  at   serT  \N  if  and  only  if  n(s)<h,+m,,  for  some  m,el. 

In  the  proof  of  Lemma  3  we  noted  .hat  P{^)>V k> P(^fr) ,  k>l.  So  by  (i): 

(iii)  /?[!  +  |=|  k]  >  Wk  >  fll  +  ^-k],  k>2. 
Substituting  (iii)  into  (ii)  and  rearranging,  we  see  that  (ii)  will  hold  if,  for  l<t<k: 

(iv)   6  k.[i-6+6e-8  k]  >  e(l-6l)  +  6  k  k     -[1-6+6-6  k  l]. 

Recalling  the  definition  of  T,    and  using  a  bit  of  algebra  we  have  that  (ii)  holds  if: 

(v)   (5Ck.(i-^^).[l-(5Ck-1+'"+Ck-t.(T^f)]  >[e-6  k+Ck"].(l-^t))  l<t<k-l. 

c  c 

Now  by  (4.7)(ii),   8k-[l-6(l-6)Vk_]]>e  and  Vk_1<(^))  k>l;   so,  6  k{l-6+6e)>e,  k>2. 

By  Lemma  3,  c,  >1,  k>l.   Obviously,  (-j_T   "tc)<-r.   So,  if  c,    ,+...+c,  _.>t+l,  then  we 

may  conclude  that   [LHS  of  (v)]  >  e-[l-tf]  >  [RHS  of  (v)],  and  (ii)  holds. 

Suppose  ck_m=l,  l<m<t.   Then  by  (4.8),  62/(l+6+62)<e<6/(l+6),  and  ck<2.   Also, 


Wk=/?(l-£2t)/(l-<$2)+£2tWk_t,  and  Wfc_t<j9(l-^(k  ^)/(l-<52).  So  (ii)  holds  if,  for 

l<t<k: 

p  ^  p  2k t  2 

5  k[j^  ~  0~V(l-£)Wk_t]>e,  which  holds  if  6  k[j^        }>e,  which  holds  if  j^^- 
But  (4.8)  implies  that  if  t>82/(l+6),  then  ck=l,  Vk>l.   Since  Wk</?(^),  firm  i  will 
wait  at  least  one  period  to  reach  an  alternating  path.   Thus  we  have  proven  that  (ii)  holds 
for  l<m(s)<ck.   The  integer  mk  is  given  by:   mkEMax{m(s)|  (ii)  holds}— ck>0.  We  conclude 

that  i's  best  play  on  N*  is  given  by   a-,  for  {m,}  inductively  defined  in  this  manner. 

It  remains  to  show  that  firm  j(s)   cannot  gain  by  deviating  from   a  at  seF  for 

which  n(s)>hw  N+m,  /  n,  and  cr(s)=(l,l).   We  sketch  the  argument  here,  as  the  reasoning 

closely  parallels  that  employed  in  this  part  of  the  proof  of  Theorem  3.   Again,  there  are 

three  cases,  distinguished  by  the  behavior  of  a  at  the  nodes  s'  and  s' ',  respectively 

reached  if  j  does  or  does  not  follow  a- at   s.   If  <x(s')=(l,l)  then  j  will  not  deviate,  since 
that  involves  waiting  to  reach  a  node  where  it  will  produce,  when  it  could  produce  and 

reach  a  node  from  which  it  might  wait.   If  a-(s'  ')=()  then  j  will  not  deviate,  since  by  the 
definition  of  h\  ,  the  fact  that  <r(s)=(l,l)  implies  V-(<r)(s)  is  too  large  for  j  to  forebear  at 

K  j 

s  in  order  to  reach  s",  where  i  forebears.   There  remains  the  case  in  which  a-(s')—0, 

but  a(s'  ')=(1,1),  so  by  deviating  j  avoids  reaching  a  node  where  i  would  have  begun  to 
wait.   Argument  analogous  to  that  used  for  Theorem  3  disposes  of  this  case  as  well. 

The  point,  as  in  the  proof  of  Theorem  3,  is  that  j  never  gains  by  delaying  i's  wait. 

Bear  in  mind  that  i  is  waiting  to  reach  the  unique  admissible  path  in  rT  at  some  node  s/ 


40 


for  which  n(s/)=n,  — c,  .   If  i   were  to  begin  waiting  for  s/    from  a  node  s  for  which 

k(s)=k  and  n(s)=n,  — c,  +m,  then  the  payoff  to  j  as  a  consequence  of  this  waiting  by  i  is: 

P(l—8    )/(l—6)+6        Vi, ,  for  V,  _,  ai  defined  in  (4.7).   If,  however,  i   were  to  begin  at 

s— (1,1),  then  the  implied  payoff  to  j  at   s  is:   ^e+^(l-^n~1)/(l-6)+^mVk_1.   The 

latter  payoff  is  less  than  the  former  since,  by  (4.7),  <T^ V,  _•■</?( t— r)-   Thus,  if  j  deviates 

from  a  at  s,  it  would  strictly  prefer  that  i   begin  waiting  at   s' '  than  at   s' '— t - (1,1),  for 
any  t>l.   And  yet,  it  would  rather  follow  a  at  s  than  deviate  and  reach  s' '  with  i 
beginning  to  wait  there.   So  j  will  not  deviate  from  a  under  any  contingency,  and  we  may 
conclude  that  a  is  a  restricted  MPE  on   N~. 

* 

(ii)    We  now  show  that   {seS|n(s)<h,  /  \+m,  /  JcM*.   Since   aeEiTT,  Theorem  1  implies 

that  there  is  some   ueS    such  that   a=a  on  N".   By  definition  of  the  profile   a  and  the 
sequence  {m,  }  we  know  that   {s,(s,o)}cM(a),  for  all  seN  for  which  n(s)<n,  /  \+mw  \. 

So  we  will  have  proven  the  Theorem  if  we  can  show  that,  for  every  seS\N",  there  is  some 

* 

ctgE     which  generates  a  collusive  path  from  s.   We  do  so  by  constructing  an  extension  of 

a  which  has  the  property  that  it  generates  an  alternating  path  from   s   until   N~  is 

reached,  after  which  it  follows   a. 

Note  that  {st(s,ar)}n^0,VseS\K  since  (a,0)eN~  and  (0,b)eN~,  Va,b>0.   For  seS\N~, 

let   s  be  the  first  node  reached  from  s  along  {s,(s,&-)}  such  that  seN".  Then  it  must  be 

that  n(s)<h,/-N    If  k(s)>0  and   a(s)=a(s)  we  will  say  that   a  is  "compatible"  with   a 

at  s.   When   a  and   a  are  not  compatible  at   s,  then  redefine  a  so  that  firm  A 
produces  when  the  sum  of  stocks  is  odd,  instead  of  even.  With  this  redefinition  in  mind 

we  can  state:  VseS\K  there  exists  an  alternating  profile   a  which  is  compatible  with   a  in 
the  sense  that  they  both  have  the  same  firm  producing  at  the  first  node  reached  from  s 

under   a  that  lies  in  N~.   With   a  so  defined,  let  ^(s)E{st(s,a)}llN"  and  let   a[s]  be  a 

strategy  profile  defined  on  y£(s)  so  that-    j[s](s')=a(s'),  Vs'cN",  and   c-[s](s')=a(s'), 

Vs'£^(s)\N~.   In  words,  for  given  seS\N",  a[s]  is  the  profile  defined  on  the  union  of  N" 
and  the  compatible  alternating  path  from  s,  such  that  it  generates  alternation  outside  of 

N",  and  follows   a  on  N\  It  is  obvious  that    ^(s)  is  closed  to  deviation  from   cr[s],  and 

that   <r[s]   generates  a  collusive  path  from  s.   We  now  show  that   cr[s]eE  WcV 

Clearly  we  need  only  be  conerned  with  points  s'6  ^(s)\N\   The  proof  of  Theorem  2 
showed  that  it  pays  to  deviate  from  alternation  only  when  the  payoff  at  the  node  reached 

without  deviating  exceeds  0{j^)-   Neither  firm's  payoff  exceeds  this  bound  at  s.   At  any 


41 


predecessor  of  s  and  successor  of  s  along  the  path  {s.(s,a)},  each  firm's  payoff  consists  of 

receiving  /?  on  alternate  dates  until  the  node  s  is  reached,  and  hence  may  be  written: 

/?( l-<?2t)/(l-<52)+  <52tV .( cr)(i)<^( l— 62t)/(l— <?2)+ <52t/?(^^)</?(^|),  in  view  of  the  fact  that 

e<8/(l+6).   So  neither  firm  will  deviate  from   <r[s]   anywhere  on  ^(s).   Hence 

*  _    *  _ 

cr[s]eSi    wqy   By  Theorem  1,  3  aeS    such  that  ct[s]e<7  on  <s6(s),  and  so  seM*.        n 


Derivation  of  Equation  (5.2): 

We  seek  w*(y)Elim{h-mw    /,  t-,\\}.   From  the  implicit  definition  of  {m£ }  given 

in  equation  (A. 2),  and  the  properties  of  the  MPE   a*  established  in  the  proof  of  Theorem 
3,  we  conclude  that  m(s)=mf  /  \  if  and  only  if  the  following  condition  holds: 


(A.3)    ^<5m(s)+1. 


1-6 


>•  1-6 


2k(s)l  >  ihxW^  +  ^+^-tuM.MTC*? 


Kt<k 


>/M*(s)+2. 


1- 


L   1-6 


2" 


1-6' 


} 


(A.3)  states  that  firm  i(s)   does  better  tc  wait  for  3(e)  from  s  than  to  produce  (and  to 
continue  producing  until  waiting  becomes  optimal),  but  that  this  would  not  be  so  if  the 
wait  were  one  period  longer.   In  view  of  the  fact  that  h-k(s  (h,R))-*y,  as  h-»0,  we  have  that 

h-m(s)-»w*(y)  as  h-k(s)-»y  in  (A.3).   in  other  words,  (5.2)  is  obtained  from  (A.3)  by 
letting  h->0  in  the  latter,  identifying  w*(y)  with  lim{h-m(s)},  and  y  with  lim{h-k(s)}. 

The  key  to  the  derivation  is  the  observation  that  h-t->0  as  h-»0.   Assuming  this  is 
true  for  the  moment,  the  limit  of  the  RES  of  the  first  inequality  in  (A. 3)  can  be  evaluated 
using  calculus.  Introduce  the  infinitesmais  dysh-t  and  dw»h-[m(s)— m(s— t •  (1,1))]  for  hxO; 
let  h-m(s)-*w*  and  h-k(s)-*y  as  h-»0.  In  the  text  we  noted  that,  by  taking  limits  in  (4.4) 

the  waiting  time  defined  in  (4.5)  satisfies:  h-m(s)-»H — ln[pd-(e— ^)e  ry]Ew(x,y),  as  h-k(s)-y 

and  h-n(s)-tx.   Therefore,  simple  differentiation  yields  the  following: 


dv.r. 


(i)     dw  s  w(x,y)-w(x,y-dy)  s  -gjxj)  •  dy 


'  2(2e-l)e 
-l+(2e-l)e 


2ry- 

5r7 


dy  =  [l-u(y)].dy, 


where  u(y)E[l-(2e-l)e2ry]/[l+(2£-l)e:'ry].    Now,  using  the  definition  of  6(h)  and  /3(h) 
and  taking  hsO,  (A.3)  becomes: 

,..,     -rw*  rl-^~2ry1        ,    J     -rdy    -r(w*-dw)  rl-e~2r(y"dy) 
(«)  e         -[-21—    ]~fdy  +  e      *-e    K  '•[— -^ J. 

Note  that  (ii)  has  the  intuitive  interpretation  that  firm  i,  with  capacity  y,  is  indifferent 
between  waiting   w*   units  of  time  to  share  the  market,  or  producing  for  a  short  interval 
and  waiting  thereafter.   Letting  dy->0  in  (ii),  and  using  (i)  yields  the  following: 


-rw 


2ry- 


-2iyi 


(iii)  0  =  e-e-iw   .g.(i-e-^).u(y)+e-^]. 


It  is  easy  to  see  that   (iii)  implies  (5.2). 
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Notice  that  u(0)=(l-e)/e<l;  u'(y)<0;  and  u(y)=0,  for  yE-ln(2e-l)/2r.   So  there 

is  a  unique  ye(0,y)  satisfying:    [2e— u(y)]/[2— u(y)]  =  e—  r^.    For  y>y   the  value  of  w*(y) 
implied  by  (iii)  is  negative,  so  the  first  inequality  in  (A. 3)  fails  when  h  is  sufficiently 
small,  and  hence  w*(y)=0.   Finally,  we  must  verify  the  presumption  that  h- 1-»0  as  h-»0, 
upon  which  the  above  argument  is  based.   This  presumption  is  valid  if,  for  hsO  and  firm  i 
indifferent  between  producing  and  waiting,  producing  for  any  positive  interval  leaves  it  at  a 
point  where  waiting  is  preferred.   This  is  Equivalent  to  the  requirement: 

<9[w*(y)— w(x,y)]/5y  <  0  whenever  w*(y)=w(x,y). 

But  by  (iii),  w*(y)  is  decreasing,  while  by  (i),  w(x,y)  is  increasing  in  y.         o 
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